Dataset statistics
| Number of variables | 44 |
|---|---|
| Number of observations | 6575956 |
| Missing cells | 1802992 |
| Missing cells (%) | 0.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.2 GiB |
| Average record size in memory | 352.0 B |
Variable types
| Numeric | 17 |
|---|---|
| Categorical | 27 |
CAE_ESTADO has constant value "1.0" | Constant |
POS_FECHA_POSTULACION has a high cardinality: 601550 distinct values | High cardinality |
IES_NOMBRE_INSTIT has a high cardinality: 245 distinct values | High cardinality |
CAR_NOMBRE_CARRERA has a high cardinality: 410 distinct values | High cardinality |
CANTON has a high cardinality: 78 distinct values | High cardinality |
PARROQUIA has a high cardinality: 113 distinct values | High cardinality |
CAM_NOMBRE_CAMPUS has a high cardinality: 175 distinct values | High cardinality |
Unnamed: 0 is highly overall correlated with PRD_ID_NUM_POSTULACION | High correlation |
INS_ID is highly overall correlated with INI_ID and 8 other fields | High correlation |
INI_ID is highly overall correlated with INS_ID and 8 other fields | High correlation |
CAE_NOTA_POSTULA is highly overall correlated with NOTA_POSTULA | High correlation |
POS_ID is highly overall correlated with INS_ID and 7 other fields | High correlation |
CUS_ID is highly overall correlated with INS_ID and 7 other fields | High correlation |
NOTA_POSTULA is highly overall correlated with CAE_NOTA_POSTULA | High correlation |
PRD_ID_NUM_POSTULACION is highly overall correlated with Unnamed: 0 and 1 other fields | High correlation |
IES_ID is highly overall correlated with IES_TIPO_IES | High correlation |
OFA_ID is highly overall correlated with INS_ID and 7 other fields | High correlation |
APC_ID is highly overall correlated with INS_ID and 7 other fields | High correlation |
CCP_ID is highly overall correlated with INS_ID and 7 other fields | High correlation |
CAR_ID is highly overall correlated with IES_TIPO_IES and 1 other fields | High correlation |
MODALIDAD_ID is highly overall correlated with MODALIDAD and 2 other fields | High correlation |
AREA_ID is highly overall correlated with AREA_NOMBRE and 1 other fields | High correlation |
SUBAREA_ID is highly overall correlated with AREA_NOMBRE and 1 other fields | High correlation |
PER_ID is highly overall correlated with INS_ID and 8 other fields | High correlation |
INS_POBLACION is highly overall correlated with INS_TIPO_INSCRIPCION | High correlation |
INS_TIPO_INSCRIPCION is highly overall correlated with INS_ID and 2 other fields | High correlation |
SEGMENTO_ASPIRANTE is highly overall correlated with CAE_GRUPO | High correlation |
CAE_GRUPO is highly overall correlated with PER_ID and 1 other fields | High correlation |
IES_TIPO_IES is highly overall correlated with IES_ID and 2 other fields | High correlation |
IES_TIPO_FINANCIAMIENTO is highly overall correlated with PRD_ID_SEGMENTO and 1 other fields | High correlation |
MODALIDAD is highly overall correlated with MODALIDAD_ID and 2 other fields | High correlation |
JORNADA_ID is highly overall correlated with MODALIDAD_ID and 3 other fields | High correlation |
JORNADA is highly overall correlated with MODALIDAD_ID and 3 other fields | High correlation |
NIVEL is highly overall correlated with IES_TIPO_IES | High correlation |
AREA_NOMBRE is highly overall correlated with AREA_ID and 2 other fields | High correlation |
SUBAREA_NOMBRE is highly overall correlated with CAR_ID and 3 other fields | High correlation |
PROVINCIA is highly overall correlated with CANTON | High correlation |
CANTON is highly overall correlated with JORNADA_ID and 2 other fields | High correlation |
PRD_ID_SEGMENTO is highly overall correlated with IES_TIPO_FINANCIAMIENTO and 1 other fields | High correlation |
SEGMETO_CARRERA is highly overall correlated with IES_TIPO_FINANCIAMIENTO and 2 other fields | High correlation |
archivo is highly overall correlated with INS_ID and 9 other fields | High correlation |
SEGMENTO_ASPIRANTE is highly imbalanced (56.6%) | Imbalance |
CAE_GRUPO is highly imbalanced (61.8%) | Imbalance |
POS_ESTADO is highly imbalanced (> 99.9%) | Imbalance |
IES_TIPO_IES is highly imbalanced (59.1%) | Imbalance |
IES_TIPO_FINANCIAMIENTO is highly imbalanced (89.4%) | Imbalance |
IES_ESTADO is highly imbalanced (99.9%) | Imbalance |
MODALIDAD is highly imbalanced (66.3%) | Imbalance |
NIVEL is highly imbalanced (78.2%) | Imbalance |
PRD_ID_SEGMENTO is highly imbalanced (90.4%) | Imbalance |
SEGMETO_CARRERA is highly imbalanced (93.4%) | Imbalance |
INS_POBLACION has 1390432 (21.1%) missing values | Missing |
Reproduction
| Analysis started | 2023-03-10 07:31:37.575840 |
|---|---|
| Analysis finished | 2023-03-10 07:46:15.119031 |
| Duration | 14 minutes and 37.54 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
| Distinct | 861282 |
|---|---|
| Distinct (%) | 13.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 267908.12 |
| Minimum | 1 |
|---|---|
| Maximum | 861282 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 19453 |
| Q1 | 101652 |
| median | 217683 |
| Q3 | 394023 |
| 95-th percentile | 670846 |
| Maximum | 861282 |
| Range | 861281 |
| Interquartile range (IQR) | 292371 |
Descriptive statistics
| Standard deviation | 204594.32 |
|---|---|
| Coefficient of variation (CV) | 0.76367344 |
| Kurtosis | -0.30705975 |
| Mean | 267908.12 |
| Median Absolute Deviation (MAD) | 134983 |
| Skewness | 0.77808933 |
| Sum | 1.761752 × 1012 |
| Variance | 4.1858834 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 21 | < 0.1% |
| 461 | 21 | < 0.1% |
| 453 | 21 | < 0.1% |
| 454 | 21 | < 0.1% |
| 455 | 21 | < 0.1% |
| 456 | 21 | < 0.1% |
| 457 | 21 | < 0.1% |
| 458 | 21 | < 0.1% |
| 459 | 21 | < 0.1% |
| 460 | 21 | < 0.1% |
| Other values (861272) | 6575746 |
| Value | Count | Frequency (%) |
| 1 | 21 | |
| 2 | 21 | |
| 3 | 21 | |
| 4 | 21 | |
| 5 | 21 | |
| 6 | 21 | |
| 7 | 21 | |
| 8 | 21 | |
| 9 | 21 | |
| 10 | 21 |
| Value | Count | Frequency (%) |
| 861282 | 1 | |
| 861281 | 1 | |
| 861280 | 1 | |
| 861279 | 1 | |
| 861278 | 1 | |
| 861277 | 1 | |
| 861276 | 1 | |
| 861275 | 1 | |
| 861274 | 1 | |
| 861273 | 1 |
INS_ID
Real number (ℝ)
| Distinct | 970233 |
|---|---|
| Distinct (%) | 14.8% |
| Missing | 4317 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9631397.1 |
| Minimum | 7006528 |
|---|---|
| Maximum | 12266030 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 7006528 |
|---|---|
| 5-th percentile | 7185591 |
| Q1 | 8275071 |
| median | 9476282 |
| Q3 | 11358329 |
| 95-th percentile | 11892310 |
| Maximum | 12266030 |
| Range | 5259502 |
| Interquartile range (IQR) | 3083258 |
Descriptive statistics
| Standard deviation | 1553580.3 |
|---|---|
| Coefficient of variation (CV) | 0.16130373 |
| Kurtosis | -1.2437661 |
| Mean | 9631397.1 |
| Median Absolute Deviation (MAD) | 1313842 |
| Skewness | 0.016919285 |
| Sum | 6.3294065 × 1013 |
| Variance | 2.4136117 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12263583 | 18 | < 0.1% |
| 11404388 | 18 | < 0.1% |
| 11421690 | 18 | < 0.1% |
| 11861922 | 18 | < 0.1% |
| 11696132 | 18 | < 0.1% |
| 11588166 | 18 | < 0.1% |
| 11995611 | 18 | < 0.1% |
| 12037948 | 18 | < 0.1% |
| 11733932 | 18 | < 0.1% |
| 11618976 | 18 | < 0.1% |
| Other values (970223) | 6571459 | |
| (Missing) | 4317 | 0.1% |
| Value | Count | Frequency (%) |
| 7006528 | 5 | < 0.1% |
| 7006530 | 5 | < 0.1% |
| 7006532 | 15 | |
| 7006534 | 3 | < 0.1% |
| 7006536 | 7 | |
| 7006538 | 15 | |
| 7006540 | 3 | < 0.1% |
| 7006542 | 10 | |
| 7006544 | 5 | < 0.1% |
| 7006546 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 12266030 | 7 | < 0.1% |
| 12265994 | 5 | < 0.1% |
| 12265986 | 18 | |
| 12265981 | 3 | < 0.1% |
| 12265977 | 9 | |
| 12265974 | 5 | < 0.1% |
| 12265962 | 5 | < 0.1% |
| 12265961 | 5 | < 0.1% |
| 12265960 | 2 | < 0.1% |
| 12265959 | 5 | < 0.1% |
INI_ID
Real number (ℝ)
| Distinct | 970231 |
|---|---|
| Distinct (%) | 14.8% |
| Missing | 4317 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5715046.1 |
| Minimum | 3941098 |
|---|---|
| Maximum | 7723593 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 3941098 |
|---|---|
| 5-th percentile | 4030626 |
| Q1 | 4803508 |
| median | 5622128 |
| Q3 | 6794408 |
| 95-th percentile | 7068742 |
| Maximum | 7723593 |
| Range | 3782495 |
| Interquartile range (IQR) | 1990900 |
Descriptive statistics
| Standard deviation | 1029845.3 |
|---|---|
| Coefficient of variation (CV) | 0.18019896 |
| Kurtosis | -1.1716456 |
| Mean | 5715046.1 |
| Median Absolute Deviation (MAD) | 856436 |
| Skewness | -0.092666549 |
| Sum | 3.755722 × 1013 |
| Variance | 1.0605814 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7459572 | 18 | < 0.1% |
| 6877165 | 18 | < 0.1% |
| 6833413 | 18 | < 0.1% |
| 7053573 | 18 | < 0.1% |
| 6970637 | 18 | < 0.1% |
| 6916683 | 18 | < 0.1% |
| 7422449 | 18 | < 0.1% |
| 7479712 | 18 | < 0.1% |
| 6989531 | 18 | < 0.1% |
| 6932056 | 18 | < 0.1% |
| Other values (970221) | 6571459 | |
| (Missing) | 4317 | 0.1% |
| Value | Count | Frequency (%) |
| 3941098 | 5 | < 0.1% |
| 3941099 | 5 | < 0.1% |
| 3941100 | 15 | |
| 3941101 | 3 | < 0.1% |
| 3941102 | 7 | |
| 3941103 | 15 | |
| 3941104 | 3 | < 0.1% |
| 3941105 | 10 | |
| 3941106 | 5 | < 0.1% |
| 3941107 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 7723593 | 7 | |
| 7723589 | 5 | < 0.1% |
| 7723588 | 15 | |
| 7723585 | 3 | < 0.1% |
| 7723584 | 6 | < 0.1% |
| 7723581 | 9 | |
| 7723580 | 5 | < 0.1% |
| 7723575 | 9 | |
| 7723571 | 5 | < 0.1% |
| 7723570 | 13 |
PER_ID
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| 22 | |
|---|---|
| 19 | |
| 21 | |
| 20 | |
| 18 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 13151912 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 22 |
|---|---|
| 2nd row | 22 |
| 3rd row | 22 |
| 4th row | 22 |
| 5th row | 22 |
Common Values
| Value | Count | Frequency (%) |
| 22 | 1620447 | |
| 19 | 1401656 | |
| 21 | 1308865 | |
| 20 | 1200308 | |
| 18 | 1044680 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 22 | 1620447 | |
| 19 | 1401656 | |
| 21 | 1308865 | |
| 20 | 1200308 | |
| 18 | 1044680 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 5750067 | |
| 1 | 3755201 | |
| 9 | 1401656 | 10.7% |
| 0 | 1200308 | 9.1% |
| 8 | 1044680 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13151912 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5750067 | |
| 1 | 3755201 | |
| 9 | 1401656 | 10.7% |
| 0 | 1200308 | 9.1% |
| 8 | 1044680 | 7.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13151912 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 5750067 | |
| 1 | 3755201 | |
| 9 | 1401656 | 10.7% |
| 0 | 1200308 | 9.1% |
| 8 | 1044680 | 7.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13151912 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 5750067 | |
| 1 | 3755201 | |
| 9 | 1401656 | 10.7% |
| 0 | 1200308 | 9.1% |
| 8 | 1044680 | 7.9% |
INS_POBLACION
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1390432 |
| Missing (%) | 21.1% |
| Memory size | 50.2 MiB |
| No escolar | |
|---|---|
| Escolar | |
| Escolar rezagado | 371 |
Length
| Max length | 16 |
|---|---|
| Median length | 10 |
| Mean length | 8.8711854 |
| Min length | 7 |
Characters and Unicode
| Total characters | 46001745 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No escolar |
|---|---|
| 2nd row | No escolar |
| 3rd row | No escolar |
| 4th row | No escolar |
| 5th row | No escolar |
Common Values
| Value | Count | Frequency (%) |
| No escolar | 3233246 | |
| Escolar | 1951907 | |
| Escolar rezagado | 371 | < 0.1% |
| (Missing) | 1390432 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| escolar | 5185524 | |
| no | 3233246 | |
| rezagado | 371 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 8419141 | |
| a | 5186266 | |
| r | 5185895 | |
| s | 5185524 | |
| c | 5185524 | |
| l | 5185524 | |
| 3233617 | 7.0% | |
| e | 3233617 | 7.0% |
| N | 3233246 | 7.0% |
| E | 1952278 | 4.2% |
| Other values (3) | 1113 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37582604 | |
| Uppercase Letter | 5185524 | 11.3% |
| Space Separator | 3233617 | 7.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 8419141 | |
| a | 5186266 | |
| r | 5185895 | |
| s | 5185524 | |
| c | 5185524 | |
| l | 5185524 | |
| e | 3233617 | 8.6% |
| z | 371 | < 0.1% |
| g | 371 | < 0.1% |
| d | 371 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3233246 | |
| E | 1952278 |
Space Separator
| Value | Count | Frequency (%) |
| 3233617 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42768128 | |
| Common | 3233617 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 8419141 | |
| a | 5186266 | |
| r | 5185895 | |
| s | 5185524 | |
| c | 5185524 | |
| l | 5185524 | |
| e | 3233617 | 7.6% |
| N | 3233246 | 7.6% |
| E | 1952278 | 4.6% |
| z | 371 | < 0.1% |
| Other values (2) | 742 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 3233617 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46001745 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 8419141 | |
| a | 5186266 | |
| r | 5185895 | |
| s | 5185524 | |
| c | 5185524 | |
| l | 5185524 | |
| 3233617 | 7.0% | |
| e | 3233617 | 7.0% |
| N | 3233246 | 7.0% |
| E | 1952278 | 4.2% |
| Other values (3) | 1113 | < 0.1% |
INS_TIPO_INSCRIPCION
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| 1.0 | |
|---|---|
| 3.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 19675182 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 3.0 |
| 4th row | 3.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 5258059 | |
| 3.0 | 1300335 | 19.8% |
| (Missing) | 17562 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 5258059 | |
| 3.0 | 1300335 | 19.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 6558394 | |
| 0 | 6558394 | |
| 1 | 5258059 | |
| 3 | 1300335 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13116788 | |
| Other Punctuation | 6558394 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6558394 | |
| 1 | 5258059 | |
| 3 | 1300335 | 9.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6558394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19675182 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 6558394 | |
| 0 | 6558394 | |
| 1 | 5258059 | |
| 3 | 1300335 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19675182 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 6558394 | |
| 0 | 6558394 | |
| 1 | 5258059 | |
| 3 | 1300335 | 6.6% |
SEGMENTO_ASPIRANTE
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| POBLACION GENERAL | |
|---|---|
| POLITICA DE ACCION AFIRMATIVA | |
| IES PARTICULAR | 22513 |
| MERITO TERRITORIAL | 6440 |
| GAR | 2600 |
Length
| Max length | 29 |
|---|---|
| Median length | 17 |
| Mean length | 21.596246 |
| Min length | 3 |
Characters and Unicode
| Total characters | 141636687 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | POBLACION GENERAL |
|---|---|
| 2nd row | POBLACION GENERAL |
| 3rd row | POBLACION GENERAL |
| 4th row | POBLACION GENERAL |
| 5th row | POBLACION GENERAL |
Common Values
| Value | Count | Frequency (%) |
| POBLACION GENERAL | 4006717 | |
| POLITICA DE ACCION AFIRMATIVA | 2520124 | |
| IES PARTICULAR | 22513 | 0.3% |
| MERITO TERRITORIAL | 6440 | 0.1% |
| GAR | 2600 | < 0.1% |
| (Missing) | 17562 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| poblacion | 4006717 | |
| general | 4006717 | |
| politica | 2520124 | |
| de | 2520124 | |
| accion | 2520124 | |
| afirmativa | 2520124 | |
| ies | 22513 | 0.1% |
| particular | 22513 | 0.1% |
| merito | 6440 | < 0.1% |
| territorial | 6440 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 20668120 | |
| I | 16671683 | |
| O | 13066562 | |
| 11596042 | ||
| C | 11589602 | |
| E | 10568951 | |
| L | 10562511 | |
| N | 10533558 | |
| R | 6600227 | 4.7% |
| P | 6549354 | 4.6% |
| Other values (9) | 23230077 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 130040645 | |
| Space Separator | 11596042 | 8.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 20668120 | |
| I | 16671683 | |
| O | 13066562 | |
| C | 11589602 | |
| E | 10568951 | |
| L | 10562511 | |
| N | 10533558 | |
| R | 6600227 | 5.1% |
| P | 6549354 | 5.0% |
| T | 5082081 | 3.9% |
| Other values (8) | 18147996 |
Space Separator
| Value | Count | Frequency (%) |
| 11596042 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 130040645 | |
| Common | 11596042 | 8.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 20668120 | |
| I | 16671683 | |
| O | 13066562 | |
| C | 11589602 | |
| E | 10568951 | |
| L | 10562511 | |
| N | 10533558 | |
| R | 6600227 | 5.1% |
| P | 6549354 | 5.0% |
| T | 5082081 | 3.9% |
| Other values (8) | 18147996 |
Common
| Value | Count | Frequency (%) |
| 11596042 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 141636687 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 20668120 | |
| I | 16671683 | |
| O | 13066562 | |
| 11596042 | ||
| C | 11589602 | |
| E | 10568951 | |
| L | 10562511 | |
| N | 10533558 | |
| R | 6600227 | 4.7% |
| P | 6549354 | 4.6% |
| Other values (9) | 23230077 |
CAE_GRUPO
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| POBLACION GENERAL | |
|---|---|
| /POLITICA DE ACCION AFIRMATIVA | |
| /POBLACION GENERAL | |
| POLITICA DE ACCION AFIRMATIVA | |
| /MERITO TERRITORIAL/POLITICA DE ACCION AFIRMATIVA | 7799 |
| Other values (14) | 12356 |
Length
| Max length | 53 |
|---|---|
| Median length | 17 |
| Mean length | 22.099962 |
| Min length | 3 |
Characters and Unicode
| Total characters | 144940255 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | POBLACION GENERAL |
|---|---|
| 2nd row | POBLACION GENERAL |
| 3rd row | POBLACION GENERAL |
| 4th row | POBLACION GENERAL |
| 5th row | POBLACION GENERAL |
Common Values
| Value | Count | Frequency (%) |
| POBLACION GENERAL | 3333684 | |
| /POLITICA DE ACCION AFIRMATIVA | 2167099 | |
| /POBLACION GENERAL | 681162 | 10.4% |
| POLITICA DE ACCION AFIRMATIVA | 356294 | 5.4% |
| /MERITO TERRITORIAL/POLITICA DE ACCION AFIRMATIVA | 7799 | 0.1% |
| /MERITO TERRITORIAL | 6440 | 0.1% |
| POLITICA DE ACCION AFIRMATIVA/MERITO TERRITORIAL | 2220 | < 0.1% |
| GAR | 1730 | < 0.1% |
| GAR/POLITICA DE ACCION AFIRMATIVA | 783 | < 0.1% |
| /GAR | 698 | < 0.1% |
| Other values (9) | 485 | < 0.1% |
| (Missing) | 17562 | 0.3% |
Length
| Value | Count | Frequency (%) |
| poblacion | 4014846 | |
| general | 4014846 | |
| de | 2534508 | |
| accion | 2534508 | |
| afirmativa | 2532147 | |
| politica | 2525754 | |
| merito | 14239 | 0.1% |
| territorial | 8833 | < 0.1% |
| territorial/politica | 7888 | < 0.1% |
| gar | 2444 | < 0.1% |
| Other values (7) | 3508 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 20722667 | |
| I | 16737585 | |
| O | 13132204 | |
| 11635127 | ||
| C | 11618370 | |
| E | 10597696 | |
| L | 10580921 | |
| N | 10564218 | |
| R | 6619988 | 4.6% |
| P | 6549372 | 4.5% |
| Other values (9) | 26182107 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 130430413 | |
| Space Separator | 11635127 | 8.0% |
| Other Punctuation | 2874715 | 2.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 20722667 | |
| I | 16737585 | |
| O | 13132204 | |
| C | 11618370 | |
| E | 10597696 | |
| L | 10580921 | |
| N | 10564218 | |
| R | 6619988 | 5.1% |
| P | 6549372 | 5.0% |
| T | 5119197 | 3.9% |
| Other values (7) | 18188195 |
Space Separator
| Value | Count | Frequency (%) |
| 11635127 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2874715 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 130430413 | |
| Common | 14509842 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 20722667 | |
| I | 16737585 | |
| O | 13132204 | |
| C | 11618370 | |
| E | 10597696 | |
| L | 10580921 | |
| N | 10564218 | |
| R | 6619988 | 5.1% |
| P | 6549372 | 5.0% |
| T | 5119197 | 3.9% |
| Other values (7) | 18188195 |
Common
| Value | Count | Frequency (%) |
| 11635127 | ||
| / | 2874715 | 19.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 144940255 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 20722667 | |
| I | 16737585 | |
| O | 13132204 | |
| 11635127 | ||
| C | 11618370 | |
| E | 10597696 | |
| L | 10580921 | |
| N | 10564218 | |
| R | 6619988 | 4.6% |
| P | 6549372 | 4.5% |
| Other values (9) | 26182107 |
CAE_ESTADO
Categorical
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 19675182 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 6558394 | |
| (Missing) | 17562 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 6558394 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6558394 | |
| . | 6558394 | |
| 0 | 6558394 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13116788 | |
| Other Punctuation | 6558394 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6558394 | |
| 0 | 6558394 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6558394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19675182 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 6558394 | |
| . | 6558394 | |
| 0 | 6558394 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19675182 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 6558394 | |
| . | 6558394 | |
| 0 | 6558394 |
CAE_NOTA_POSTULA
Real number (ℝ)
| Distinct | 477 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 759.91985 |
| Minimum | 400 |
|---|---|
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 400 |
|---|---|
| 5-th percentile | 661 |
| Q1 | 715 |
| median | 755 |
| Q3 | 801 |
| 95-th percentile | 877 |
| Maximum | 1000 |
| Range | 600 |
| Interquartile range (IQR) | 86 |
Descriptive statistics
| Standard deviation | 65.120012 |
|---|---|
| Coefficient of variation (CV) | 0.085693264 |
| Kurtosis | 0.10422777 |
| Mean | 759.91985 |
| Median Absolute Deviation (MAD) | 43 |
| Skewness | 0.36177401 |
| Sum | 4.9838538 × 109 |
| Variance | 4240.616 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 742 | 43316 | 0.7% |
| 750 | 43031 | 0.7% |
| 747 | 42994 | 0.7% |
| 733 | 42922 | 0.7% |
| 741 | 42803 | 0.7% |
| 735 | 42661 | 0.6% |
| 753 | 42661 | 0.6% |
| 743 | 42598 | 0.6% |
| 746 | 42550 | 0.6% |
| 745 | 42542 | 0.6% |
| Other values (467) | 6130316 |
| Value | Count | Frequency (%) |
| 400 | 5 | < 0.1% |
| 466 | 5 | < 0.1% |
| 478 | 5 | < 0.1% |
| 487 | 5 | < 0.1% |
| 499 | 4 | < 0.1% |
| 518 | 7 | < 0.1% |
| 520 | 23 | |
| 525 | 10 | |
| 529 | 15 | |
| 530 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000 | 215 | |
| 999 | 35 | < 0.1% |
| 998 | 37 | < 0.1% |
| 997 | 44 | < 0.1% |
| 996 | 70 | < 0.1% |
| 995 | 85 | < 0.1% |
| 994 | 69 | < 0.1% |
| 993 | 119 | |
| 992 | 107 | |
| 991 | 174 |
POS_ID
Real number (ℝ)
| Distinct | 6575921 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23008348 |
| Minimum | 16425446 |
|---|---|
| Maximum | 29581641 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 16425446 |
|---|---|
| 5-th percentile | 17083586 |
| Q1 | 19726792 |
| median | 23011160 |
| Q3 | 26295474 |
| 95-th percentile | 28924030 |
| Maximum | 29581641 |
| Range | 13156195 |
| Interquartile range (IQR) | 6568683 |
Descriptive statistics
| Standard deviation | 3796002.2 |
|---|---|
| Coefficient of variation (CV) | 0.16498369 |
| Kurtosis | -1.1980221 |
| Mean | 23008348 |
| Median Absolute Deviation (MAD) | 3284341.5 |
| Skewness | -0.0019412153 |
| Sum | 1.5130188 × 1014 |
| Variance | 1.4409632 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24000000 | 8 | < 0.1% |
| 20000000 | 6 | < 0.1% |
| 21000000 | 6 | < 0.1% |
| 28000000 | 6 | < 0.1% |
| 27000000 | 5 | < 0.1% |
| 25000000 | 5 | < 0.1% |
| 19000000 | 4 | < 0.1% |
| 26000000 | 3 | < 0.1% |
| 29103938 | 1 | < 0.1% |
| 24198419 | 1 | < 0.1% |
| Other values (6575911) | 6575911 |
| Value | Count | Frequency (%) |
| 16425446 | 1 | |
| 16425447 | 1 | |
| 16425448 | 1 | |
| 16425449 | 1 | |
| 16425450 | 1 | |
| 16425453 | 1 | |
| 16425454 | 1 | |
| 16425460 | 1 | |
| 16425461 | 1 | |
| 16425462 | 1 |
| Value | Count | Frequency (%) |
| 29581641 | 1 | |
| 29581640 | 1 | |
| 29581631 | 1 | |
| 29581630 | 1 | |
| 29581629 | 1 | |
| 29581625 | 1 | |
| 29581624 | 1 | |
| 29581621 | 1 | |
| 29581620 | 1 | |
| 29581619 | 1 |
POS_FECHA_POSTULACION
Categorical
| Distinct | 601550 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| 08/08/2019 | 312910 |
|---|---|
| 19/11/2020 | 124783 |
| 09/08/2019 | 117099 |
| 10/08/2019 | 87169 |
| 11/08/2019 | 68004 |
| Other values (601545) |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 16.291171 |
| Min length | 10 |
Characters and Unicode
| Total characters | 106843918 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 29536 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 27/10/2021 15:27 |
|---|---|
| 2nd row | 27/10/2021 15:27 |
| 3rd row | 27/10/2021 15:27 |
| 4th row | 27/10/2021 15:27 |
| 5th row | 27/10/2021 15:27 |
Common Values
| Value | Count | Frequency (%) |
| 08/08/2019 | 312910 | 4.8% |
| 19/11/2020 | 124783 | 1.9% |
| 09/08/2019 | 117099 | 1.8% |
| 10/08/2019 | 87169 | 1.3% |
| 11/08/2019 | 68004 | 1.0% |
| 20/11/2020 | 36435 | 0.6% |
| 23/10/2020 23:26 | 1595 | < 0.1% |
| 23/10/2020 23:30 | 1471 | < 0.1% |
| 23/10/2020 20:49 | 1401 | < 0.1% |
| 23/10/2020 21:03 | 1383 | < 0.1% |
| Other values (601540) | 5806144 | |
| (Missing) | 17562 | 0.3% |
Length
| Value | Count | Frequency (%) |
| 19/4/2021 | 609459 | 4.9% |
| 29/9/2021 | 551833 | 4.5% |
| 13/3/2020 | 509857 | 4.1% |
| 24/10/2020 | 468356 | 3.8% |
| 19/5/2020 | 323269 | 2.6% |
| 08/08/2019 | 312910 | 2.5% |
| 02/05/2021 | 297627 | 2.4% |
| 14/10/2021 | 286534 | 2.3% |
| 6/11/2020 | 271124 | 2.2% |
| 6/6/2020 | 184581 | 1.5% |
| Other values (86457) | 8554838 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 18816626 | |
| 0 | 16370052 | |
| 1 | 16228563 | |
| / | 13116788 | |
| : | 9919166 | |
| 5811994 | 5.4% | |
| 9 | 5462684 | 5.1% |
| 3 | 5155482 | 4.8% |
| 4 | 4738674 | 4.4% |
| 5 | 4361762 | 4.1% |
| Other values (3) | 6862127 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 77995970 | |
| Other Punctuation | 23035954 | 21.6% |
| Space Separator | 5811994 | 5.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 18816626 | |
| 0 | 16370052 | |
| 1 | 16228563 | |
| 9 | 5462684 | 7.0% |
| 3 | 5155482 | 6.6% |
| 4 | 4738674 | 6.1% |
| 5 | 4361762 | 5.6% |
| 8 | 2844314 | 3.6% |
| 6 | 2234719 | 2.9% |
| 7 | 1783094 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 13116788 | |
| : | 9919166 |
Space Separator
| Value | Count | Frequency (%) |
| 5811994 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 106843918 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 18816626 | |
| 0 | 16370052 | |
| 1 | 16228563 | |
| / | 13116788 | |
| : | 9919166 | |
| 5811994 | 5.4% | |
| 9 | 5462684 | 5.1% |
| 3 | 5155482 | 4.8% |
| 4 | 4738674 | 4.4% |
| 5 | 4361762 | 4.1% |
| Other values (3) | 6862127 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 106843918 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 18816626 | |
| 0 | 16370052 | |
| 1 | 16228563 | |
| / | 13116788 | |
| : | 9919166 | |
| 5811994 | 5.4% | |
| 9 | 5462684 | 5.1% |
| 3 | 5155482 | 4.8% |
| 4 | 4738674 | 4.4% |
| 5 | 4361762 | 4.1% |
| Other values (3) | 6862127 | 6.4% |
CUS_ID
Real number (ℝ)
| Distinct | 25164 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 290284.95 |
| Minimum | 267051 |
|---|---|
| Maximum | 313609 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 267051 |
|---|---|
| 5-th percentile | 267979 |
| Q1 | 277658 |
| median | 290603 |
| Q3 | 303116 |
| 95-th percentile | 311712 |
| Maximum | 313609 |
| Range | 46558 |
| Interquartile range (IQR) | 25458 |
Descriptive statistics
| Standard deviation | 13340.978 |
|---|---|
| Coefficient of variation (CV) | 0.045958215 |
| Kurtosis | -1.1164819 |
| Mean | 290284.95 |
| Median Absolute Deviation (MAD) | 12935 |
| Skewness | -0.053459198 |
| Sum | 1.9038031 × 1012 |
| Variance | 1.7798169 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 304787 | 9436 | 0.1% |
| 304597 | 9257 | 0.1% |
| 304825 | 8232 | 0.1% |
| 304791 | 7545 | 0.1% |
| 294999 | 6951 | 0.1% |
| 295018 | 6836 | 0.1% |
| 304779 | 6301 | 0.1% |
| 277665 | 5921 | 0.1% |
| 277397 | 5761 | 0.1% |
| 286432 | 5753 | 0.1% |
| Other values (25154) | 6486401 | |
| (Missing) | 17562 | 0.3% |
| Value | Count | Frequency (%) |
| 267051 | 633 | |
| 267052 | 386 | |
| 267053 | 380 | |
| 267054 | 572 | |
| 267055 | 248 | < 0.1% |
| 267056 | 230 | < 0.1% |
| 267057 | 314 | |
| 267058 | 66 | < 0.1% |
| 267059 | 63 | < 0.1% |
| 267060 | 23 | < 0.1% |
| Value | Count | Frequency (%) |
| 313609 | 17 | < 0.1% |
| 313608 | 2 | < 0.1% |
| 313607 | 2 | < 0.1% |
| 313606 | 15 | < 0.1% |
| 313605 | 10 | < 0.1% |
| 313604 | 27 | < 0.1% |
| 313603 | 794 | |
| 313602 | 4 | < 0.1% |
| 313601 | 1 | < 0.1% |
| 313600 | 86 | < 0.1% |
NOTA_POSTULA
Real number (ℝ)
| Distinct | 525 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 759.48521 |
| Minimum | 392 |
|---|---|
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 392 |
|---|---|
| 5-th percentile | 660 |
| Q1 | 715 |
| median | 754 |
| Q3 | 801 |
| 95-th percentile | 877 |
| Maximum | 1000 |
| Range | 608 |
| Interquartile range (IQR) | 86 |
Descriptive statistics
| Standard deviation | 65.247155 |
|---|---|
| Coefficient of variation (CV) | 0.085909711 |
| Kurtosis | 0.11734681 |
| Mean | 759.48521 |
| Median Absolute Deviation (MAD) | 43 |
| Skewness | 0.34512059 |
| Sum | 4.9810032 × 109 |
| Variance | 4257.1912 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 742 | 43303 | 0.7% |
| 750 | 42987 | 0.7% |
| 733 | 42878 | 0.7% |
| 747 | 42856 | 0.7% |
| 741 | 42742 | 0.6% |
| 735 | 42585 | 0.6% |
| 753 | 42531 | 0.6% |
| 743 | 42475 | 0.6% |
| 746 | 42445 | 0.6% |
| 745 | 42396 | 0.6% |
| Other values (515) | 6131196 |
| Value | Count | Frequency (%) |
| 392 | 2 | < 0.1% |
| 400 | 5 | |
| 433 | 1 | < 0.1% |
| 437 | 1 | < 0.1% |
| 439 | 1 | < 0.1% |
| 448 | 1 | < 0.1% |
| 453 | 2 | < 0.1% |
| 462 | 1 | < 0.1% |
| 464 | 3 | < 0.1% |
| 466 | 8 |
| Value | Count | Frequency (%) |
| 1000 | 215 | |
| 999 | 35 | < 0.1% |
| 998 | 37 | < 0.1% |
| 997 | 39 | < 0.1% |
| 996 | 70 | < 0.1% |
| 995 | 79 | < 0.1% |
| 994 | 67 | < 0.1% |
| 993 | 114 | |
| 992 | 107 | |
| 991 | 173 |
PRD_ID_NUM_POSTULACION
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 479.18149 |
| Minimum | 446 |
|---|---|
| Maximum | 1092 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 446 |
|---|---|
| 5-th percentile | 446 |
| Q1 | 446 |
| median | 446 |
| Q3 | 492 |
| 95-th percentile | 494 |
| Maximum | 1092 |
| Range | 646 |
| Interquartile range (IQR) | 46 |
Descriptive statistics
| Standard deviation | 89.484575 |
|---|---|
| Coefficient of variation (CV) | 0.18674464 |
| Kurtosis | 38.53175 |
| Mean | 479.18149 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.1183298 |
| Sum | 3.1510764 × 109 |
| Variance | 8007.4891 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 446 | 3597362 | |
| 492 | 1802730 | |
| 494 | 1037891 | 15.8% |
| 1092 | 120411 | 1.8% |
| 908 | 14819 | 0.2% |
| 808 | 2058 | < 0.1% |
| 561 | 685 | < 0.1% |
| Value | Count | Frequency (%) |
| 446 | 3597362 | |
| 492 | 1802730 | |
| 494 | 1037891 | 15.8% |
| 561 | 685 | < 0.1% |
| 808 | 2058 | < 0.1% |
| 908 | 14819 | 0.2% |
| 1092 | 120411 | 1.8% |
| Value | Count | Frequency (%) |
| 1092 | 120411 | 1.8% |
| 908 | 14819 | 0.2% |
| 808 | 2058 | < 0.1% |
| 561 | 685 | < 0.1% |
| 494 | 1037891 | 15.8% |
| 492 | 1802730 | |
| 446 | 3597362 |
POS_PRIORIDAD
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 4 | |
| 5 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6575956 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 5 |
| 3rd row | 3 |
| 4th row | 1 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1573514 | |
| 2 | 1449702 | |
| 3 | 1344376 | |
| 4 | 1177104 | |
| 5 | 1031260 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1573514 | |
| 2 | 1449702 | |
| 3 | 1344376 | |
| 4 | 1177104 | |
| 5 | 1031260 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1573514 | |
| 2 | 1449702 | |
| 3 | 1344376 | |
| 4 | 1177104 | |
| 5 | 1031260 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6575956 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1573514 | |
| 2 | 1449702 | |
| 3 | 1344376 | |
| 4 | 1177104 | |
| 5 | 1031260 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6575956 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1573514 | |
| 2 | 1449702 | |
| 3 | 1344376 | |
| 4 | 1177104 | |
| 5 | 1031260 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6575956 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1573514 | |
| 2 | 1449702 | |
| 3 | 1344376 | |
| 4 | 1177104 | |
| 5 | 1031260 |
POS_ESTADO
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| 1.0 | |
|---|---|
| 0.0 | 180 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 19675182 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 6558214 | |
| 0.0 | 180 | < 0.1% |
| (Missing) | 17562 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 6558214 | |
| 0.0 | 180 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6558574 | |
| . | 6558394 | |
| 1 | 6558214 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13116788 | |
| Other Punctuation | 6558394 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6558574 | |
| 1 | 6558214 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6558394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19675182 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6558574 | |
| . | 6558394 | |
| 1 | 6558214 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19675182 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6558574 | |
| . | 6558394 | |
| 1 | 6558214 |
IES_ID
Real number (ℝ)
| Distinct | 250 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 109.83078 |
| Minimum | 22 |
|---|---|
| Maximum | 1054 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 22 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 48 |
| median | 59 |
| Q3 | 85 |
| 95-th percentile | 513 |
| Maximum | 1054 |
| Range | 1032 |
| Interquartile range (IQR) | 37 |
Descriptive statistics
| Standard deviation | 177.00474 |
|---|---|
| Coefficient of variation (CV) | 1.6116133 |
| Kurtosis | 13.167232 |
| Mean | 109.83078 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 3.6550662 |
| Sum | 7.2224235 × 108 |
| Variance | 31330.678 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 51 | 1274395 | |
| 46 | 858900 | |
| 59 | 581336 | 8.8% |
| 86 | 421493 | 6.4% |
| 22 | 313590 | 4.8% |
| 72 | 253365 | 3.9% |
| 48 | 240948 | 3.7% |
| 102 | 218339 | 3.3% |
| 88 | 206074 | 3.1% |
| 85 | 188422 | 2.9% |
| Other values (240) | 2019094 |
| Value | Count | Frequency (%) |
| 22 | 313590 | |
| 23 | 29410 | 0.4% |
| 29 | 113738 | 1.7% |
| 30 | 17047 | 0.3% |
| 31 | 134715 | |
| 32 | 44250 | 0.7% |
| 38 | 6610 | 0.1% |
| 39 | 53468 | 0.8% |
| 43 | 347 | < 0.1% |
| 44 | 5966 | 0.1% |
| Value | Count | Frequency (%) |
| 1054 | 132 | < 0.1% |
| 1053 | 117 | < 0.1% |
| 1051 | 527 | < 0.1% |
| 1050 | 129 | < 0.1% |
| 1049 | 13 | < 0.1% |
| 1047 | 681 | < 0.1% |
| 1046 | 6450 | 0.1% |
| 1045 | 411 | < 0.1% |
| 1040 | 10912 | |
| 1034 | 17938 |
IES_NOMBRE_INSTIT
Categorical
| Distinct | 245 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| UNIVERSIDAD DE GUAYAQUIL | |
|---|---|
| UNIVERSIDAD CENTRAL DEL ECUADOR | |
| UNIVERSIDAD ESTATAL DE MILAGRO | |
| UNIVERSIDAD TECNICA DE MANABI | |
| UNIVERSIDAD DE LAS FUERZAS ARMADAS (ESPE) | |
| Other values (240) |
Length
| Max length | 82 |
|---|---|
| Median length | 76 |
| Mean length | 31.835111 |
| Min length | 15 |
Characters and Unicode
| Total characters | 208787201 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UNIVERSIDAD CENTRAL DEL ECUADOR |
|---|---|
| 2nd row | UNIVERSIDAD ESTATAL DE MILAGRO |
| 3rd row | UNIVERSIDAD NACIONAL DE LOJA |
| 4th row | UNIVERSIDAD NACIONAL DE LOJA |
| 5th row | UNIVERSIDAD NACIONAL DE LOJA |
Common Values
| Value | Count | Frequency (%) |
| UNIVERSIDAD DE GUAYAQUIL | 1274395 | |
| UNIVERSIDAD CENTRAL DEL ECUADOR | 858889 | |
| UNIVERSIDAD ESTATAL DE MILAGRO | 578566 | 8.8% |
| UNIVERSIDAD TECNICA DE MANABI | 421269 | 6.4% |
| UNIVERSIDAD DE LAS FUERZAS ARMADAS (ESPE) | 312102 | 4.7% |
| UNIVERSIDAD NACIONAL DE LOJA | 253365 | 3.9% |
| UNIVERSIDAD DE CUENCA | 240948 | 3.7% |
| UNIVERSIDAD LAICA ELOY ALFARO DE MANABI | 218151 | 3.3% |
| UNIVERSIDAD TECNICA DEL NORTE | 206074 | 3.1% |
| UNIVERSIDAD TECNICA DE MACHALA | 188422 | 2.9% |
| Other values (235) | 2006213 |
Length
| Value | Count | Frequency (%) |
| universidad | 5714024 | |
| de | 4739258 | |
| tecnica | 1419808 | 5.2% |
| guayaquil | 1338198 | 4.9% |
| del | 1289113 | 4.7% |
| estatal | 966292 | 3.5% |
| ecuador | 930261 | 3.4% |
| central | 888299 | 3.3% |
| superior | 730619 | 2.7% |
| manabi | 726107 | 2.7% |
| Other values (373) | 8499184 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 23937039 | |
| E | 21304068 | |
| I | 20733349 | |
| 20683724 | ||
| D | 18987441 | |
| N | 12656324 | 6.1% |
| U | 12199504 | 5.8% |
| R | 12037258 | 5.8% |
| S | 10587440 | 5.1% |
| L | 9122560 | 4.4% |
| Other values (34) | 46538494 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 187414670 | |
| Space Separator | 20683724 | 9.9% |
| Open Punctuation | 315219 | 0.2% |
| Close Punctuation | 315219 | 0.2% |
| Modifier Symbol | 22951 | < 0.1% |
| Decimal Number | 14916 | < 0.1% |
| Dash Punctuation | 7254 | < 0.1% |
| Lowercase Letter | 7011 | < 0.1% |
| Other Punctuation | 6237 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 23937039 | |
| E | 21304068 | |
| I | 20733349 | |
| D | 18987441 | |
| N | 12656324 | 6.8% |
| U | 12199504 | 6.5% |
| R | 12037258 | 6.4% |
| S | 10587440 | 5.6% |
| L | 9122560 | 4.9% |
| C | 9107142 | 4.9% |
| Other values (22) | 36742545 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7350 | |
| 7 | 7350 | |
| 0 | 162 | 1.1% |
| 2 | 54 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5870 | |
| ' | 367 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 20683724 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 315219 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 315219 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 22951 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7254 |
Lowercase Letter
| Value | Count | Frequency (%) |
| ü | 7011 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 187421681 | |
| Common | 21365520 | 10.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 23937039 | |
| E | 21304068 | |
| I | 20733349 | |
| D | 18987441 | |
| N | 12656324 | 6.8% |
| U | 12199504 | 6.5% |
| R | 12037258 | 6.4% |
| S | 10587440 | 5.6% |
| L | 9122560 | 4.9% |
| C | 9107142 | 4.9% |
| Other values (23) | 36749556 |
Common
| Value | Count | Frequency (%) |
| 20683724 | ||
| ( | 315219 | 1.5% |
| ) | 315219 | 1.5% |
| ´ | 22951 | 0.1% |
| 1 | 7350 | < 0.1% |
| 7 | 7350 | < 0.1% |
| - | 7254 | < 0.1% |
| . | 5870 | < 0.1% |
| ' | 367 | < 0.1% |
| 0 | 162 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 208003981 | |
| None | 783220 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 23937039 | |
| E | 21304068 | |
| I | 20733349 | |
| 20683724 | ||
| D | 18987441 | |
| N | 12656324 | 6.1% |
| U | 12199504 | 5.9% |
| R | 12037258 | 5.8% |
| S | 10587440 | 5.1% |
| L | 9122560 | 4.4% |
| Other values (26) | 45755274 |
None
| Value | Count | Frequency (%) |
| Ó | 603049 | |
| Í | 89625 | 11.4% |
| É | 39043 | 5.0% |
| ´ | 22951 | 2.9% |
| Ñ | 14937 | 1.9% |
| ü | 7011 | 0.9% |
| Á | 4048 | 0.5% |
| Ú | 2556 | 0.3% |
IES_TIPO_IES
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| U | |
|---|---|
| I | 537001 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6558394 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | U |
|---|---|
| 2nd row | U |
| 3rd row | U |
| 4th row | U |
| 5th row | U |
Common Values
| Value | Count | Frequency (%) |
| U | 6021393 | |
| I | 537001 | 8.2% |
| (Missing) | 17562 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| u | 6021393 | |
| i | 537001 | 8.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 6021393 | |
| I | 537001 | 8.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6558394 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 6021393 | |
| I | 537001 | 8.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6558394 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 6021393 | |
| I | 537001 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6558394 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 6021393 | |
| I | 537001 | 8.2% |
IES_TIPO_FINANCIAMIENTO
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| PÚBLICA | |
|---|---|
| COFINANCIADA | 68817 |
| AUTOFINANCIADA | 68203 |
Length
| Max length | 14 |
|---|---|
| Median length | 7 |
| Mean length | 7.1252602 |
| Min length | 7 |
Characters and Unicode
| Total characters | 46730264 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PÚBLICA |
|---|---|
| 2nd row | PÚBLICA |
| 3rd row | PÚBLICA |
| 4th row | PÚBLICA |
| 5th row | PÚBLICA |
Common Values
| Value | Count | Frequency (%) |
| PÚBLICA | 6421374 | |
| COFINANCIADA | 68817 | 1.0% |
| AUTOFINANCIADA | 68203 | 1.0% |
| (Missing) | 17562 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pública | 6421374 | |
| cofinanciada | 68817 | 1.0% |
| autofinanciada | 68203 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 6900637 | |
| I | 6695414 | |
| C | 6627211 | |
| P | 6421374 | |
| Ú | 6421374 | |
| B | 6421374 | |
| L | 6421374 | |
| N | 274040 | 0.6% |
| O | 137020 | 0.3% |
| F | 137020 | 0.3% |
| Other values (3) | 273426 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 46730264 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6900637 | |
| I | 6695414 | |
| C | 6627211 | |
| P | 6421374 | |
| Ú | 6421374 | |
| B | 6421374 | |
| L | 6421374 | |
| N | 274040 | 0.6% |
| O | 137020 | 0.3% |
| F | 137020 | 0.3% |
| Other values (3) | 273426 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 46730264 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 6900637 | |
| I | 6695414 | |
| C | 6627211 | |
| P | 6421374 | |
| Ú | 6421374 | |
| B | 6421374 | |
| L | 6421374 | |
| N | 274040 | 0.6% |
| O | 137020 | 0.3% |
| F | 137020 | 0.3% |
| Other values (3) | 273426 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40308890 | |
| None | 6421374 | 13.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 6900637 | |
| I | 6695414 | |
| C | 6627211 | |
| P | 6421374 | |
| B | 6421374 | |
| L | 6421374 | |
| N | 274040 | 0.7% |
| O | 137020 | 0.3% |
| F | 137020 | 0.3% |
| D | 137020 | 0.3% |
| Other values (2) | 136406 | 0.3% |
None
| Value | Count | Frequency (%) |
| Ú | 6421374 |
OFA_ID
Real number (ℝ)
| Distinct | 11597 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 152032.9 |
| Minimum | 93025 |
|---|---|
| Maximum | 183438 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 93025 |
|---|---|
| 5-th percentile | 96115 |
| Q1 | 144010 |
| median | 158819 |
| Q3 | 171929 |
| 95-th percentile | 180709 |
| Maximum | 183438 |
| Range | 90413 |
| Interquartile range (IQR) | 27919 |
Descriptive statistics
| Standard deviation | 26147.786 |
|---|---|
| Coefficient of variation (CV) | 0.17198768 |
| Kurtosis | 0.14055579 |
| Mean | 152032.9 |
| Median Absolute Deviation (MAD) | 13614 |
| Skewness | -1.1080245 |
| Sum | 9.9976165 × 1011 |
| Variance | 6.837067 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 176762 | 18148 | 0.3% |
| 180856 | 17440 | 0.3% |
| 167181 | 15361 | 0.2% |
| 148309 | 14760 | 0.2% |
| 179253 | 14689 | 0.2% |
| 180490 | 14551 | 0.2% |
| 182232 | 14521 | 0.2% |
| 174129 | 14287 | 0.2% |
| 167183 | 12879 | 0.2% |
| 172328 | 12527 | 0.2% |
| Other values (11587) | 6426793 |
| Value | Count | Frequency (%) |
| 93025 | 1455 | < 0.1% |
| 93028 | 5748 | |
| 93029 | 752 | < 0.1% |
| 93034 | 442 | < 0.1% |
| 93037 | 2103 | < 0.1% |
| 93040 | 1303 | < 0.1% |
| 93041 | 2277 | < 0.1% |
| 93047 | 1424 | < 0.1% |
| 93051 | 895 | < 0.1% |
| 93054 | 3099 |
| Value | Count | Frequency (%) |
| 183438 | 435 | < 0.1% |
| 183436 | 38 | < 0.1% |
| 183433 | 369 | < 0.1% |
| 183432 | 333 | < 0.1% |
| 183429 | 183 | < 0.1% |
| 183425 | 530 | < 0.1% |
| 183424 | 535 | < 0.1% |
| 183408 | 548 | < 0.1% |
| 183402 | 158 | < 0.1% |
| 183395 | 2718 |
IES_ESTADO
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| A | |
|---|---|
| I | 487 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6558394 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 6557907 | |
| I | 487 | < 0.1% |
| (Missing) | 17562 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 6557907 | |
| i | 487 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 6557907 | |
| I | 487 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6558394 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6557907 | |
| I | 487 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6558394 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 6557907 | |
| I | 487 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6558394 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 6557907 | |
| I | 487 | < 0.1% |
APC_ID
Real number (ℝ)
| Distinct | 8720 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40587.797 |
| Minimum | 27861 |
|---|---|
| Maximum | 48518 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 27861 |
|---|---|
| 5-th percentile | 28648 |
| Q1 | 37848 |
| median | 41681 |
| Q3 | 45335 |
| 95-th percentile | 47747 |
| Maximum | 48518 |
| Range | 20657 |
| Interquartile range (IQR) | 7487 |
Descriptive statistics
| Standard deviation | 5870.914 |
|---|---|
| Coefficient of variation (CV) | 0.14464727 |
| Kurtosis | -0.32792615 |
| Mean | 40587.797 |
| Median Absolute Deviation (MAD) | 3785 |
| Skewness | -0.8157006 |
| Sum | 2.6619076 × 1011 |
| Variance | 34467631 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38987 | 21699 | 0.3% |
| 46270 | 19960 | 0.3% |
| 43645 | 19129 | 0.3% |
| 45466 | 19005 | 0.3% |
| 45103 | 18801 | 0.3% |
| 46269 | 18400 | 0.3% |
| 36736 | 18239 | 0.3% |
| 47778 | 18148 | 0.3% |
| 43996 | 16976 | 0.3% |
| 38524 | 16519 | 0.3% |
| Other values (8710) | 6371518 | |
| (Missing) | 17562 | 0.3% |
| Value | Count | Frequency (%) |
| 27861 | 2258 | |
| 27862 | 1161 | |
| 27863 | 700 | < 0.1% |
| 27864 | 1495 | |
| 27865 | 1292 | |
| 27866 | 1624 | |
| 27867 | 2277 | |
| 27868 | 2319 | |
| 27871 | 1233 | |
| 27872 | 480 | < 0.1% |
| Value | Count | Frequency (%) |
| 48518 | 243 | < 0.1% |
| 48517 | 1078 | < 0.1% |
| 48516 | 81 | < 0.1% |
| 48512 | 435 | < 0.1% |
| 48511 | 2718 | |
| 48510 | 12 | < 0.1% |
| 48509 | 179 | < 0.1% |
| 48508 | 571 | < 0.1% |
| 48507 | 216 | < 0.1% |
| 48506 | 493 | < 0.1% |
CCP_ID
Real number (ℝ)
| Distinct | 11597 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25891.568 |
| Minimum | 19105 |
|---|---|
| Maximum | 32288 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 19105 |
|---|---|
| 5-th percentile | 19979 |
| Q1 | 22417 |
| median | 26042 |
| Q3 | 29095 |
| 95-th percentile | 31858 |
| Maximum | 32288 |
| Range | 13183 |
| Interquartile range (IQR) | 6678 |
Descriptive statistics
| Standard deviation | 3762.3523 |
|---|---|
| Coefficient of variation (CV) | 0.14531187 |
| Kurtosis | -1.1894783 |
| Mean | 25891.568 |
| Median Absolute Deviation (MAD) | 3422 |
| Skewness | -0.033997259 |
| Sum | 1.7026182 × 1011 |
| Variance | 14155295 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30197 | 18148 | 0.3% |
| 30216 | 17440 | 0.3% |
| 29091 | 15361 | 0.2% |
| 22365 | 14760 | 0.2% |
| 29284 | 14689 | 0.2% |
| 32071 | 14551 | 0.2% |
| 30201 | 14521 | 0.2% |
| 30220 | 14287 | 0.2% |
| 29076 | 12879 | 0.2% |
| 32087 | 12527 | 0.2% |
| Other values (11587) | 6426793 |
| Value | Count | Frequency (%) |
| 19105 | 24 | |
| 19109 | 57 | |
| 19113 | 24 | |
| 19117 | 21 | < 0.1% |
| 19121 | 24 | |
| 19125 | 24 | |
| 19129 | 20 | < 0.1% |
| 19133 | 29 | |
| 19134 | 6 | < 0.1% |
| 19135 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 32288 | 381 | |
| 32268 | 69 | < 0.1% |
| 32256 | 548 | |
| 32255 | 530 | |
| 32254 | 69 | < 0.1% |
| 32253 | 57 | < 0.1% |
| 32252 | 85 | < 0.1% |
| 32251 | 131 | < 0.1% |
| 32250 | 201 | < 0.1% |
| 32249 | 233 |
CAR_ID
Real number (ℝ)
| Distinct | 420 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5310.1894 |
| Minimum | 4447 |
|---|---|
| Maximum | 7603 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 4447 |
|---|---|
| 5-th percentile | 4462 |
| Q1 | 4641 |
| median | 5043 |
| Q3 | 5457 |
| 95-th percentile | 7203 |
| Maximum | 7603 |
| Range | 3156 |
| Interquartile range (IQR) | 816 |
Descriptive statistics
| Standard deviation | 872.93091 |
|---|---|
| Coefficient of variation (CV) | 0.1643879 |
| Kurtosis | 0.54862205 |
| Mean | 5310.1894 |
| Median Absolute Deviation (MAD) | 414 |
| Skewness | 1.3053596 |
| Sum | 3.4919572 × 1010 |
| Variance | 762008.38 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5455 | 316362 | 4.8% |
| 4781 | 300470 | 4.6% |
| 5476 | 288861 | 4.4% |
| 4641 | 278625 | 4.2% |
| 4473 | 274270 | 4.2% |
| 4478 | 256306 | 3.9% |
| 5334 | 212909 | 3.2% |
| 5457 | 210782 | 3.2% |
| 4458 | 200350 | 3.0% |
| 4462 | 196591 | 3.0% |
| Other values (410) | 4040430 |
| Value | Count | Frequency (%) |
| 4447 | 593 | < 0.1% |
| 4458 | 200350 | |
| 4459 | 55800 | 0.8% |
| 4460 | 8359 | 0.1% |
| 4461 | 41599 | 0.6% |
| 4462 | 196591 | |
| 4470 | 127 | < 0.1% |
| 4473 | 274270 | |
| 4474 | 123605 | |
| 4478 | 256306 |
| Value | Count | Frequency (%) |
| 7603 | 346 | |
| 7602 | 38 | < 0.1% |
| 7601 | 74 | < 0.1% |
| 7600 | 243 | |
| 7598 | 2 | < 0.1% |
| 7597 | 19 | < 0.1% |
| 7596 | 7 | < 0.1% |
| 7595 | 18 | < 0.1% |
| 7593 | 422 | |
| 7587 | 7 | < 0.1% |
CAR_NOMBRE_CARRERA
Categorical
| Distinct | 410 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| ADMINISTRACION DE EMPRESAS | 316857 |
|---|---|
| DERECHO | 303256 |
| PSICOLOGIA | 288861 |
| ENFERMERIA | 278776 |
| EDUCACION INICIAL | 273737 |
| Other values (405) |
Length
| Max length | 99 |
|---|---|
| Median length | 78 |
| Mean length | 18.970351 |
| Min length | 4 |
Characters and Unicode
| Total characters | 124415036 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ADMINISTRACION DE EMPRESAS |
|---|---|
| 2nd row | COMUNICACION |
| 3rd row | DERECHO |
| 4th row | COMUNICACION |
| 5th row | EDUCACION INICIAL |
Common Values
| Value | Count | Frequency (%) |
| ADMINISTRACION DE EMPRESAS | 316857 | 4.8% |
| DERECHO | 303256 | 4.6% |
| PSICOLOGIA | 288861 | 4.4% |
| ENFERMERIA | 278776 | 4.2% |
| EDUCACION INICIAL | 273737 | 4.2% |
| EDUCACION BASICA | 256102 | 3.9% |
| ECONOMIA | 212543 | 3.2% |
| TURISMO | 209349 | 3.2% |
| CONTABILIDAD Y AUDITORIA | 200125 | 3.0% |
| MEDICINA | 196591 | 3.0% |
| Other values (400) | 4022197 |
Length
| Value | Count | Frequency (%) |
| de | 1064811 | 7.0% |
| y | 796864 | 5.3% |
| educacion | 557321 | 3.7% |
| tecnologia | 539182 | 3.6% |
| superior | 538249 | 3.6% |
| en | 513685 | 3.4% |
| ingenieria | 446759 | 2.9% |
| pedagogia | 391950 | 2.6% |
| la | 387082 | 2.6% |
| administracion | 380075 | 2.5% |
| Other values (412) | 9531334 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 16422565 | |
| A | 14013504 | |
| E | 12035749 | |
| O | 10935687 | |
| C | 8619180 | 6.9% |
| 8589381 | 6.9% | |
| N | 8559911 | 6.9% |
| R | 7318067 | 5.9% |
| S | 5367373 | 4.3% |
| D | 5301485 | 4.3% |
| Other values (26) | 27252134 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 115820336 | |
| Space Separator | 8589381 | 6.9% |
| Other Punctuation | 5139 | < 0.1% |
| Dash Punctuation | 180 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 16422565 | |
| A | 14013504 | |
| E | 12035749 | |
| O | 10935687 | |
| C | 8619180 | 7.4% |
| N | 8559911 | 7.4% |
| R | 7318067 | 6.3% |
| S | 5367373 | 4.6% |
| D | 5301485 | 4.6% |
| T | 5261412 | 4.5% |
| Other values (23) | 21985403 |
Space Separator
| Value | Count | Frequency (%) |
| 8589381 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5139 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 180 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 115820336 | |
| Common | 8594700 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 16422565 | |
| A | 14013504 | |
| E | 12035749 | |
| O | 10935687 | |
| C | 8619180 | 7.4% |
| N | 8559911 | 7.4% |
| R | 7318067 | 6.3% |
| S | 5367373 | 4.6% |
| D | 5301485 | 4.6% |
| T | 5261412 | 4.5% |
| Other values (23) | 21985403 |
Common
| Value | Count | Frequency (%) |
| 8589381 | ||
| , | 5139 | 0.1% |
| - | 180 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 124258350 | |
| None | 156686 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 16422565 | |
| A | 14013504 | |
| E | 12035749 | |
| O | 10935687 | |
| C | 8619180 | 6.9% |
| 8589381 | 6.9% | |
| N | 8559911 | 6.9% |
| R | 7318067 | 5.9% |
| S | 5367373 | 4.3% |
| D | 5301485 | 4.3% |
| Other values (19) | 27095448 |
None
| Value | Count | Frequency (%) |
| Ñ | 145058 | |
| Ó | 7694 | 4.9% |
| Í | 1953 | 1.2% |
| Ü | 1679 | 1.1% |
| É | 138 | 0.1% |
| Ú | 129 | 0.1% |
| Á | 35 | < 0.1% |
MODALIDAD_ID
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.486144 |
| Minimum | 8 |
|---|---|
| Maximum | 940 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 9 |
| median | 9 |
| Q3 | 9 |
| 95-th percentile | 441 |
| Maximum | 940 |
| Range | 932 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 152.52656 |
|---|---|
| Coefficient of variation (CV) | 2.2941104 |
| Kurtosis | 5.5637358 |
| Mean | 66.486144 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.5037682 |
| Sum | 4.3604233 × 108 |
| Variance | 23264.35 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 5491921 | |
| 441 | 745794 | 11.3% |
| 8 | 143909 | 2.2% |
| 10 | 78000 | 1.2% |
| 442 | 74408 | 1.1% |
| 940 | 24362 | 0.4% |
| (Missing) | 17562 | 0.3% |
| Value | Count | Frequency (%) |
| 8 | 143909 | 2.2% |
| 9 | 5491921 | |
| 10 | 78000 | 1.2% |
| 441 | 745794 | 11.3% |
| 442 | 74408 | 1.1% |
| 940 | 24362 | 0.4% |
| Value | Count | Frequency (%) |
| 940 | 24362 | 0.4% |
| 442 | 74408 | 1.1% |
| 441 | 745794 | 11.3% |
| 10 | 78000 | 1.2% |
| 9 | 5491921 | |
| 8 | 143909 | 2.2% |
MODALIDAD
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| PRESENCIAL | |
|---|---|
| EN LINEA | |
| DISTANCIA | 144133 |
| SEMI-PRESENCIAL | 78453 |
| DUAL | 74663 |
Length
| Max length | 15 |
|---|---|
| Median length | 10 |
| Mean length | 9.7305029 |
| Min length | 4 |
Characters and Unicode
| Total characters | 63987359 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DISTANCIA |
|---|---|
| 2nd row | EN LINEA |
| 3rd row | DISTANCIA |
| 4th row | DISTANCIA |
| 5th row | DISTANCIA |
Common Values
| Value | Count | Frequency (%) |
| PRESENCIAL | 5504828 | |
| EN LINEA | 749282 | 11.4% |
| DISTANCIA | 144133 | 2.2% |
| SEMI-PRESENCIAL | 78453 | 1.2% |
| DUAL | 74663 | 1.1% |
| HIBRIDA | 24597 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| presencial | 5504828 | |
| en | 749282 | 10.2% |
| linea | 749282 | 10.2% |
| distancia | 144133 | 2.0% |
| semi-presencial | 78453 | 1.1% |
| dual | 74663 | 1.0% |
| hibrida | 24597 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 12743579 | |
| N | 7225978 | |
| I | 6748476 | |
| A | 6720089 | |
| L | 6407226 | |
| S | 5805867 | |
| C | 5727414 | |
| R | 5607878 | |
| P | 5583281 | |
| 749282 | 1.2% | |
| Other values (7) | 668289 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 63159624 | |
| Space Separator | 749282 | 1.2% |
| Dash Punctuation | 78453 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 12743579 | |
| N | 7225978 | |
| I | 6748476 | |
| A | 6720089 | |
| L | 6407226 | |
| S | 5805867 | |
| C | 5727414 | |
| R | 5607878 | |
| P | 5583281 | |
| D | 243393 | 0.4% |
| Other values (5) | 346443 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 749282 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 78453 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 63159624 | |
| Common | 827735 | 1.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 12743579 | |
| N | 7225978 | |
| I | 6748476 | |
| A | 6720089 | |
| L | 6407226 | |
| S | 5805867 | |
| C | 5727414 | |
| R | 5607878 | |
| P | 5583281 | |
| D | 243393 | 0.4% |
| Other values (5) | 346443 | 0.5% |
Common
| Value | Count | Frequency (%) |
| 749282 | ||
| - | 78453 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63987359 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 12743579 | |
| N | 7225978 | |
| I | 6748476 | |
| A | 6720089 | |
| L | 6407226 | |
| S | 5805867 | |
| C | 5727414 | |
| R | 5607878 | |
| P | 5583281 | |
| 749282 | 1.2% | |
| Other values (7) | 668289 | 1.0% |
JORNADA_ID
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 7.0 | |
| 3.0 | |
| 5.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 19675182 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 3.0 |
| 4th row | 3.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1897593 | |
| 2.0 | 1714874 | |
| 7.0 | 1285804 | |
| 3.0 | 992065 | |
| 5.0 | 668058 | 10.2% |
| (Missing) | 17562 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 1897593 | |
| 2.0 | 1714874 | |
| 7.0 | 1285804 | |
| 3.0 | 992065 | |
| 5.0 | 668058 | 10.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 6558394 | |
| 0 | 6558394 | |
| 1 | 1897593 | 9.6% |
| 2 | 1714874 | 8.7% |
| 7 | 1285804 | 6.5% |
| 3 | 992065 | 5.0% |
| 5 | 668058 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13116788 | |
| Other Punctuation | 6558394 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6558394 | |
| 1 | 1897593 | 14.5% |
| 2 | 1714874 | 13.1% |
| 7 | 1285804 | 9.8% |
| 3 | 992065 | 7.6% |
| 5 | 668058 | 5.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6558394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19675182 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 6558394 | |
| 0 | 6558394 | |
| 1 | 1897593 | 9.6% |
| 2 | 1714874 | 8.7% |
| 7 | 1285804 | 6.5% |
| 3 | 992065 | 5.0% |
| 5 | 668058 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19675182 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 6558394 | |
| 0 | 6558394 | |
| 1 | 1897593 | 9.6% |
| 2 | 1714874 | 8.7% |
| 7 | 1285804 | 6.5% |
| 3 | 992065 | 5.0% |
| 5 | 668058 | 3.4% |
JORNADA
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| INTENSIVA | |
|---|---|
| MATUTINA | |
| VESPERTINA | |
| NO APLICA JORNADA | |
| NOCTURNA |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 10.044605 |
| Min length | 8 |
Characters and Unicode
| Total characters | 66052879 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NO APLICA JORNADA |
|---|---|
| 2nd row | NO APLICA JORNADA |
| 3rd row | NO APLICA JORNADA |
| 4th row | NO APLICA JORNADA |
| 5th row | NO APLICA JORNADA |
Common Values
| Value | Count | Frequency (%) |
| INTENSIVA | 1900676 | |
| MATUTINA | 1719108 | |
| VESPERTINA | 1288185 | |
| NO APLICA JORNADA | 996465 | |
| NOCTURNA | 671522 | 10.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| intensiva | 1900676 | |
| matutina | 1719108 | |
| vespertina | 1288185 | |
| no | 996465 | |
| aplica | 996465 | |
| jornada | 996465 | |
| nocturna | 671522 | 7.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 11284459 | |
| N | 10144619 | |
| I | 7805110 | |
| T | 7298599 | |
| E | 4477046 | 6.8% |
| S | 3188861 | 4.8% |
| V | 3188861 | 4.8% |
| R | 2956172 | 4.5% |
| O | 2664452 | 4.0% |
| U | 2390630 | 3.6% |
| Other values (7) | 10654070 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 64059949 | |
| Space Separator | 1992930 | 3.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 11284459 | |
| N | 10144619 | |
| I | 7805110 | |
| T | 7298599 | |
| E | 4477046 | 7.0% |
| S | 3188861 | 5.0% |
| V | 3188861 | 5.0% |
| R | 2956172 | 4.6% |
| O | 2664452 | 4.2% |
| U | 2390630 | 3.7% |
| Other values (6) | 8661140 |
Space Separator
| Value | Count | Frequency (%) |
| 1992930 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64059949 | |
| Common | 1992930 | 3.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 11284459 | |
| N | 10144619 | |
| I | 7805110 | |
| T | 7298599 | |
| E | 4477046 | 7.0% |
| S | 3188861 | 5.0% |
| V | 3188861 | 5.0% |
| R | 2956172 | 4.6% |
| O | 2664452 | 4.2% |
| U | 2390630 | 3.7% |
| Other values (6) | 8661140 |
Common
| Value | Count | Frequency (%) |
| 1992930 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 66052879 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 11284459 | |
| N | 10144619 | |
| I | 7805110 | |
| T | 7298599 | |
| E | 4477046 | 6.8% |
| S | 3188861 | 4.8% |
| V | 3188861 | 4.8% |
| R | 2956172 | 4.5% |
| O | 2664452 | 4.0% |
| U | 2390630 | 3.6% |
| Other values (7) | 10654070 |
NIVEL
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| TERCER NIVEL | |
|---|---|
| TERCER NIVEL TECNOLÓGICO SUPERIOR | 445771 |
| TECNOLOGICO SUPERIOR | 108628 |
| TERCER NIVEL TÉCNICO SUPERIOR | 15827 |
| TECNICO SUPERIOR | 2799 |
Length
| Max length | 33 |
|---|---|
| Median length | 12 |
| Mean length | 13.598318 |
| Min length | 12 |
Characters and Unicode
| Total characters | 89421942 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TERCER NIVEL |
|---|---|
| 2nd row | TERCER NIVEL |
| 3rd row | TERCER NIVEL |
| 4th row | TERCER NIVEL |
| 5th row | TERCER NIVEL |
Common Values
| Value | Count | Frequency (%) |
| TERCER NIVEL | 6002931 | |
| TERCER NIVEL TECNOLÓGICO SUPERIOR | 445771 | 6.8% |
| TECNOLOGICO SUPERIOR | 108628 | 1.7% |
| TERCER NIVEL TÉCNICO SUPERIOR | 15827 | 0.2% |
| TECNICO SUPERIOR | 2799 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| tercer | 6464529 | |
| nivel | 6464529 | |
| superior | 573025 | 4.1% |
| tecnológico | 445771 | 3.2% |
| tecnologico | 108628 | 0.8% |
| técnico | 15827 | 0.1% |
| tecnico | 2799 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 20523810 | |
| R | 14075108 | |
| C | 7610579 | 8.5% |
| I | 7610579 | 8.5% |
| 7499152 | 8.4% | |
| T | 7037554 | 7.9% |
| N | 7037554 | 7.9% |
| L | 7018928 | 7.8% |
| V | 6464529 | 7.2% |
| O | 1809077 | 2.0% |
| Other values (6) | 2735072 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 81922790 | |
| Space Separator | 7499152 | 8.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 20523810 | |
| R | 14075108 | |
| C | 7610579 | 9.3% |
| I | 7610579 | 9.3% |
| T | 7037554 | 8.6% |
| N | 7037554 | 8.6% |
| L | 7018928 | 8.6% |
| V | 6464529 | 7.9% |
| O | 1809077 | 2.2% |
| S | 573025 | 0.7% |
| Other values (5) | 2162047 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 7499152 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 81922790 | |
| Common | 7499152 | 8.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 20523810 | |
| R | 14075108 | |
| C | 7610579 | 9.3% |
| I | 7610579 | 9.3% |
| T | 7037554 | 8.6% |
| N | 7037554 | 8.6% |
| L | 7018928 | 8.6% |
| V | 6464529 | 7.9% |
| O | 1809077 | 2.2% |
| S | 573025 | 0.7% |
| Other values (5) | 2162047 | 2.6% |
Common
| Value | Count | Frequency (%) |
| 7499152 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 88960344 | |
| None | 461598 | 0.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 20523810 | |
| R | 14075108 | |
| C | 7610579 | 8.6% |
| I | 7610579 | 8.6% |
| 7499152 | 8.4% | |
| T | 7037554 | 7.9% |
| N | 7037554 | 7.9% |
| L | 7018928 | 7.9% |
| V | 6464529 | 7.3% |
| O | 1809077 | 2.0% |
| Other values (4) | 2273474 | 2.6% |
None
| Value | Count | Frequency (%) |
| Ó | 445771 | |
| É | 15827 | 3.4% |
AREA_ID
Real number (ℝ)
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.31852 |
| Minimum | 2 |
|---|---|
| Maximum | 26 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 13 |
| median | 14 |
| Q3 | 17 |
| 95-th percentile | 26 |
| Maximum | 26 |
| Range | 24 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 5.2569194 |
|---|---|
| Coefficient of variation (CV) | 0.34317411 |
| Kurtosis | 0.030071772 |
| Mean | 15.31852 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.94709887 |
| Sum | 1.0046489 × 108 |
| Variance | 27.635201 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 1226193 | |
| 16 | 1053636 | |
| 14 | 993428 | |
| 26 | 988036 | |
| 9 | 950116 | |
| 18 | 345050 | 5.2% |
| 17 | 336311 | 5.1% |
| 10 | 332232 | 5.1% |
| 11 | 176959 | 2.7% |
| 12 | 101415 | 1.5% |
| Other values (6) | 55018 | 0.8% |
| Value | Count | Frequency (%) |
| 2 | 84 | < 0.1% |
| 3 | 5020 | 0.1% |
| 4 | 498 | < 0.1% |
| 7 | 5816 | 0.1% |
| 9 | 950116 | |
| 10 | 332232 | 5.1% |
| 11 | 176959 | 2.7% |
| 12 | 101415 | 1.5% |
| 13 | 1226193 | |
| 14 | 993428 |
| Value | Count | Frequency (%) |
| 26 | 988036 | |
| 24 | 43531 | 0.7% |
| 22 | 69 | < 0.1% |
| 18 | 345050 | 5.2% |
| 17 | 336311 | 5.1% |
| 16 | 1053636 | |
| 14 | 993428 | |
| 13 | 1226193 | |
| 12 | 101415 | 1.5% |
| 11 | 176959 | 2.7% |
AREA_NOMBRE
Categorical
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO | |
|---|---|
| SALUD Y BIENESTAR | |
| EDUCACION | |
| INGENIERIA, INDUSTRIA Y CONSTRUCCION | |
| ADMINISTRACION | |
| Other values (11) |
Length
| Max length | 53 |
|---|---|
| Median length | 48 |
| Mean length | 28.416752 |
| Min length | 8 |
Characters and Unicode
| Total characters | 186368255 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ADMINISTRACION |
|---|---|
| 2nd row | CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO |
| 3rd row | CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO |
| 4th row | CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO |
| 5th row | EDUCACION |
Common Values
| Value | Count | Frequency (%) |
| CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO | 1226193 | |
| SALUD Y BIENESTAR | 1053636 | |
| EDUCACION | 993428 | |
| INGENIERIA, INDUSTRIA Y CONSTRUCCION | 988036 | |
| ADMINISTRACION | 950116 | |
| TECNOLOGIAS DE LA INFORMACION Y LA COMUNICACION (TIC) | 345050 | 5.2% |
| SERVICIOS | 336311 | 5.1% |
| AGRICULTURA, SILVICULTURA, PESCA Y VETERINARIA | 332232 | 5.1% |
| ARTES Y HUMANIDADES | 176959 | 2.7% |
| CIENCIAS NATURALES, MATEMATICAS Y ESTADISTICA | 101415 | 1.5% |
| Other values (6) | 55018 | 0.8% |
Length
| Value | Count | Frequency (%) |
| y | 4277972 | |
| informacion | 1571243 | 7.0% |
| ciencias | 1376726 | 6.1% |
| sociales | 1237098 | 5.5% |
| derecho | 1231213 | 5.5% |
| periodismo | 1226262 | 5.5% |
| salud | 1059452 | 4.7% |
| bienestar | 1053636 | 4.7% |
| educacion | 998448 | 4.4% |
| ingenieria | 988036 | 4.4% |
| Other values (23) | 7461625 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 24715330 | |
| C | 16068200 | |
| A | 16053808 | |
| 15923317 | ||
| N | 15101217 | 8.1% |
| E | 14207240 | 7.6% |
| O | 13715373 | 7.4% |
| S | 12744098 | 6.8% |
| R | 11326943 | 6.1% |
| D | 7297609 | 3.9% |
| Other values (17) | 39215120 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 165499917 | |
| Space Separator | 15923317 | 8.5% |
| Other Punctuation | 4254921 | 2.3% |
| Open Punctuation | 345050 | 0.2% |
| Close Punctuation | 345050 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 24715330 | |
| C | 16068200 | |
| A | 16053808 | |
| N | 15101217 | |
| E | 14207240 | |
| O | 13715373 | |
| S | 12744098 | |
| R | 11326943 | |
| D | 7297609 | 4.4% |
| T | 6568393 | 4.0% |
| Other values (13) | 27701706 |
Space Separator
| Value | Count | Frequency (%) |
| 15923317 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4254921 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 345050 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 345050 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 165499917 | |
| Common | 20868338 | 11.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 24715330 | |
| C | 16068200 | |
| A | 16053808 | |
| N | 15101217 | |
| E | 14207240 | |
| O | 13715373 | |
| S | 12744098 | |
| R | 11326943 | |
| D | 7297609 | 4.4% |
| T | 6568393 | 4.0% |
| Other values (13) | 27701706 |
Common
| Value | Count | Frequency (%) |
| 15923317 | ||
| , | 4254921 | 20.4% |
| ( | 345050 | 1.7% |
| ) | 345050 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 186281124 | |
| None | 87131 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 24715330 | |
| C | 16068200 | |
| A | 16053808 | |
| 15923317 | ||
| N | 15101217 | 8.1% |
| E | 14207240 | 7.6% |
| O | 13715373 | 7.4% |
| S | 12744098 | 6.8% |
| R | 11326943 | 6.1% |
| D | 7297609 | 3.9% |
| Other values (14) | 39127989 |
None
| Value | Count | Frequency (%) |
| Á | 43531 | |
| Í | 43531 | |
| Ó | 69 | 0.1% |
SUBAREA_ID
Real number (ℝ)
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 411.86731 |
| Minimum | 5 |
|---|---|
| Maximum | 519 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 500 |
| median | 509 |
| Q3 | 512 |
| 95-th percentile | 517 |
| Maximum | 519 |
| Range | 514 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 187.5657 |
|---|---|
| Coefficient of variation (CV) | 0.45540322 |
| Kurtosis | 0.04541427 |
| Mean | 411.86731 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -1.4217445 |
| Sum | 2.7011881 × 109 |
| Variance | 35180.891 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 512 | 1030591 | |
| 511 | 993428 | |
| 500 | 950116 | |
| 509 | 727852 | |
| 63 | 508168 | |
| 517 | 345050 | 5.2% |
| 516 | 307699 | 4.7% |
| 510 | 300523 | 4.6% |
| 64 | 253868 | 3.9% |
| 65 | 226000 | 3.4% |
| Other values (25) | 915099 |
| Value | Count | Frequency (%) |
| 5 | 123689 | |
| 15 | 197818 | |
| 20 | 23045 | 0.4% |
| 28 | 84 | < 0.1% |
| 30 | 274 | < 0.1% |
| 32 | 1937 | < 0.1% |
| 33 | 2809 | < 0.1% |
| 37 | 498 | < 0.1% |
| 43 | 5661 | 0.1% |
| 44 | 155 | < 0.1% |
| Value | Count | Frequency (%) |
| 519 | 49 | < 0.1% |
| 517 | 345050 | 5.2% |
| 516 | 307699 | 4.7% |
| 515 | 18538 | 0.3% |
| 514 | 427 | < 0.1% |
| 513 | 9647 | 0.1% |
| 512 | 1030591 | |
| 511 | 993428 | |
| 510 | 300523 | 4.6% |
| 509 | 727852 |
SUBAREA_NOMBRE
Categorical
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| SALUD | |
|---|---|
| EDUCACION | |
| EDUCACION COMERCIAL Y ADMINISTRACION | |
| CIENCIAS SOCIALES Y DEL COMPORTAMIENTO | |
| INGENIERIA Y PROFESIONES AFINES | |
| Other values (25) |
Length
| Max length | 53 |
|---|---|
| Median length | 31 |
| Mean length | 21.88525 |
| Min length | 5 |
Characters and Unicode
| Total characters | 143532095 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EDUCACION COMERCIAL Y ADMINISTRACION |
|---|---|
| 2nd row | PERIODISMO E INFORMACION |
| 3rd row | DERECHO |
| 4th row | PERIODISMO E INFORMACION |
| 5th row | EDUCACION |
Common Values
| Value | Count | Frequency (%) |
| SALUD | 1030591 | |
| EDUCACION | 993428 | |
| EDUCACION COMERCIAL Y ADMINISTRACION | 952053 | |
| CIENCIAS SOCIALES Y DEL COMPORTAMIENTO | 728126 | |
| INGENIERIA Y PROFESIONES AFINES | 508168 | |
| TECNOLOGIAS DE LA INFORMACION Y LA COMUNICACION (TIC) | 345050 | 5.2% |
| SERVICIOS PERSONALES | 307699 | 4.7% |
| DERECHO | 303381 | 4.6% |
| INDUSTRIA Y PRODUCCION | 253868 | 3.9% |
| ARQUITECTURA Y CONSTRUCCION | 226000 | 3.4% |
| Other values (20) | 910030 |
Length
| Value | Count | Frequency (%) |
| y | 3108451 | |
| educacion | 1945481 | 10.6% |
| salud | 1030591 | 5.6% |
| comercial | 952053 | 5.2% |
| administracion | 952053 | 5.2% |
| ciencias | 829541 | 4.5% |
| sociales | 728281 | 4.0% |
| del | 728126 | 4.0% |
| comportamiento | 728126 | 4.0% |
| la | 690100 | 3.8% |
| Other values (37) | 6602108 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 17050055 | |
| C | 14573358 | |
| A | 13252455 | |
| O | 12736468 | |
| E | 12040850 | |
| 11736517 | ||
| N | 11093389 | 7.7% |
| S | 8643012 | 6.0% |
| R | 7385718 | 5.1% |
| D | 6090923 | 4.2% |
| Other values (15) | 28929350 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 131105478 | |
| Space Separator | 11736517 | 8.2% |
| Open Punctuation | 345050 | 0.2% |
| Close Punctuation | 345050 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 17050055 | |
| C | 14573358 | |
| A | 13252455 | |
| O | 12736468 | |
| E | 12040850 | |
| N | 11093389 | |
| S | 8643012 | 6.6% |
| R | 7385718 | 5.6% |
| D | 6090923 | 4.6% |
| L | 5061981 | 3.9% |
| Other values (12) | 23177269 |
Space Separator
| Value | Count | Frequency (%) |
| 11736517 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 345050 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 345050 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 131105478 | |
| Common | 12426617 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 17050055 | |
| C | 14573358 | |
| A | 13252455 | |
| O | 12736468 | |
| E | 12040850 | |
| N | 11093389 | |
| S | 8643012 | 6.6% |
| R | 7385718 | 5.6% |
| D | 6090923 | 4.6% |
| L | 5061981 | 3.9% |
| Other values (12) | 23177269 |
Common
| Value | Count | Frequency (%) |
| 11736517 | ||
| ( | 345050 | 2.8% |
| ) | 345050 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 143532075 | |
| None | 20 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 17050055 | |
| C | 14573358 | |
| A | 13252455 | |
| O | 12736468 | |
| E | 12040850 | |
| 11736517 | ||
| N | 11093389 | 7.7% |
| S | 8643012 | 6.0% |
| R | 7385718 | 5.1% |
| D | 6090923 | 4.2% |
| Other values (14) | 28929330 |
None
| Value | Count | Frequency (%) |
| Ó | 20 |
PROVINCIA
Categorical
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| GUAYAS | |
|---|---|
| PICHINCHA | |
| MANABI | |
| LOS RIOS | |
| LOJA | |
| Other values (19) |
Length
| Max length | 30 |
|---|---|
| Median length | 16 |
| Mean length | 7.3250078 |
| Min length | 4 |
Characters and Unicode
| Total characters | 48168929 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PICHINCHA |
|---|---|
| 2nd row | GUAYAS |
| 3rd row | LOJA |
| 4th row | LOJA |
| 5th row | LOJA |
Common Values
| Value | Count | Frequency (%) |
| GUAYAS | 2160962 | |
| PICHINCHA | 1430371 | |
| MANABI | 750561 | 11.4% |
| LOS RIOS | 282221 | 4.3% |
| LOJA | 268672 | 4.1% |
| AZUAY | 267075 | 4.1% |
| CHIMBORAZO | 251570 | 3.8% |
| IMBABURA | 233982 | 3.6% |
| EL ORO | 215532 | 3.3% |
| TUNGURAHUA | 210694 | 3.2% |
| Other values (14) | 504316 | 7.7% |
Length
| Value | Count | Frequency (%) |
| guayas | 2160962 | |
| pichincha | 1430371 | |
| manabi | 750561 | 10.2% |
| los | 327248 | 4.4% |
| rios | 282221 | 3.8% |
| loja | 268672 | 3.6% |
| azuay | 267075 | 3.6% |
| chimborazo | 251570 | 3.4% |
| imbabura | 233982 | 3.2% |
| el | 215532 | 2.9% |
| Other values (22) | 1191407 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 10163079 | |
| I | 4690319 | |
| H | 3396449 | 7.1% |
| C | 3363484 | 7.0% |
| U | 3305229 | 6.9% |
| S | 3198663 | 6.6% |
| N | 2746745 | 5.7% |
| Y | 2428037 | 5.0% |
| G | 2422501 | 5.0% |
| O | 2299047 | 4.8% |
| Other values (14) | 10155376 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 47365284 | |
| Space Separator | 803645 | 1.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10163079 | |
| I | 4690319 | |
| H | 3396449 | 7.2% |
| C | 3363484 | 7.1% |
| U | 3305229 | 7.0% |
| S | 3198663 | 6.8% |
| N | 2746745 | 5.8% |
| Y | 2428037 | 5.1% |
| G | 2422501 | 5.1% |
| O | 2299047 | 4.9% |
| Other values (13) | 9351731 |
Space Separator
| Value | Count | Frequency (%) |
| 803645 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47365284 | |
| Common | 803645 | 1.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 10163079 | |
| I | 4690319 | |
| H | 3396449 | 7.2% |
| C | 3363484 | 7.1% |
| U | 3305229 | 7.0% |
| S | 3198663 | 6.8% |
| N | 2746745 | 5.8% |
| Y | 2428037 | 5.1% |
| G | 2422501 | 5.1% |
| O | 2299047 | 4.9% |
| Other values (13) | 9351731 |
Common
| Value | Count | Frequency (%) |
| 803645 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48148768 | |
| None | 20161 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 10163079 | |
| I | 4690319 | |
| H | 3396449 | 7.1% |
| C | 3363484 | 7.0% |
| U | 3305229 | 6.9% |
| S | 3198663 | 6.6% |
| N | 2746745 | 5.7% |
| Y | 2428037 | 5.0% |
| G | 2422501 | 5.0% |
| O | 2299047 | 4.8% |
| Other values (13) | 10135215 |
None
| Value | Count | Frequency (%) |
| Ñ | 20161 |
CANTON
Categorical
HIGH CARDINALITY  HIGH CORRELATION 
| Distinct | 78 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| GUAYAQUIL | |
|---|---|
| DISTRITO METROPOLITANO DE QUITO | |
| MILAGRO | |
| PORTOVIEJO | |
| RUMIÑAHUI | |
| Other values (73) |
Length
| Max length | 31 |
|---|---|
| Median length | 19 |
| Mean length | 11.933109 |
| Min length | 4 |
Characters and Unicode
| Total characters | 78471600 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DISTRITO METROPOLITANO DE QUITO |
|---|---|
| 2nd row | MILAGRO |
| 3rd row | LOJA |
| 4th row | LOJA |
| 5th row | LOJA |
Common Values
| Value | Count | Frequency (%) |
| GUAYAQUIL | 1534634 | |
| DISTRITO METROPOLITANO DE QUITO | 1148237 | |
| MILAGRO | 595648 | 9.1% |
| PORTOVIEJO | 413746 | 6.3% |
| RUMIÑAHUI | 275964 | 4.2% |
| LOJA | 267695 | 4.1% |
| CUENCA | 266970 | 4.1% |
| RIOBAMBA | 250415 | 3.8% |
| MACHALA | 209321 | 3.2% |
| IBARRA | 209249 | 3.2% |
| Other values (68) | 1404077 |
Length
| Value | Count | Frequency (%) |
| guayaquil | 1534634 | |
| de | 1168030 | |
| quito | 1148257 | |
| distrito | 1148237 | |
| metropolitano | 1148237 | |
| milagro | 595648 | 5.8% |
| portoviejo | 413746 | 4.0% |
| rumiñahui | 275964 | 2.7% |
| loja | 267695 | 2.6% |
| cuenca | 266970 | 2.6% |
| Other values (82) | 2317804 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 9717085 | |
| O | 8965079 | |
| I | 8525273 | |
| T | 6900772 | 8.8% |
| U | 5488676 | 7.0% |
| R | 4548275 | 5.8% |
| L | 4294863 | 5.5% |
| 3709266 | 4.7% | |
| E | 3682344 | 4.7% |
| M | 3057108 | 3.9% |
| Other values (15) | 19582859 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 74762334 | |
| Space Separator | 3709266 | 4.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 9717085 | |
| O | 8965079 | |
| I | 8525273 | |
| T | 6900772 | |
| U | 5488676 | 7.3% |
| R | 4548275 | 6.1% |
| L | 4294863 | 5.7% |
| E | 3682344 | 4.9% |
| M | 3057108 | 4.1% |
| Q | 2826440 | 3.8% |
| Other values (14) | 16756419 |
Space Separator
| Value | Count | Frequency (%) |
| 3709266 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 74762334 | |
| Common | 3709266 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 9717085 | |
| O | 8965079 | |
| I | 8525273 | |
| T | 6900772 | |
| U | 5488676 | 7.3% |
| R | 4548275 | 6.1% |
| L | 4294863 | 5.7% |
| E | 3682344 | 4.9% |
| M | 3057108 | 4.1% |
| Q | 2826440 | 3.8% |
| Other values (14) | 16756419 |
Common
| Value | Count | Frequency (%) |
| 3709266 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78193109 | |
| None | 278491 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 9717085 | |
| O | 8965079 | |
| I | 8525273 | |
| T | 6900772 | 8.8% |
| U | 5488676 | 7.0% |
| R | 4548275 | 5.8% |
| L | 4294863 | 5.5% |
| 3709266 | 4.7% | |
| E | 3682344 | 4.7% |
| M | 3057108 | 3.9% |
| Other values (14) | 19304368 |
None
| Value | Count | Frequency (%) |
| Ñ | 278491 |
PARROQUIA
Categorical
| Distinct | 113 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| GUAYAQUIL, CABECERA CANTONAL Y CAPITAL PROVINCIAL | |
|---|---|
| QUITO DISTRITO METROPOLITANO, CABECERA CANTONAL, CAPITAL PROVINCIAL Y DE LA REPUBLICA DEL ECUADOR | |
| MILAGRO, CABECERA CANTONAL | |
| PORTOVIEJO | |
| SANGOLQUÍ | |
| Other values (108) |
Length
| Max length | 97 |
|---|---|
| Median length | 49 |
| Mean length | 42.106263 |
| Min length | 4 |
Characters and Unicode
| Total characters | 276888931 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | QUITO DISTRITO METROPOLITANO, CABECERA CANTONAL, CAPITAL PROVINCIAL Y DE LA REPUBLICA DEL ECUADOR |
|---|---|
| 2nd row | MILAGRO, CABECERA CANTONAL |
| 3rd row | LOJA, CABECERA CANTONAL Y CAPITAL PROVINCIAL |
| 4th row | LOJA, CABECERA CANTONAL Y CAPITAL PROVINCIAL |
| 5th row | LOJA, CABECERA CANTONAL Y CAPITAL PROVINCIAL |
Common Values
| Value | Count | Frequency (%) |
| GUAYAQUIL, CABECERA CANTONAL Y CAPITAL PROVINCIAL | 1534108 | |
| QUITO DISTRITO METROPOLITANO, CABECERA CANTONAL, CAPITAL PROVINCIAL Y DE LA REPUBLICA DEL ECUADOR | 1034223 | |
| MILAGRO, CABECERA CANTONAL | 595648 | 9.1% |
| PORTOVIEJO | 413746 | 6.3% |
| SANGOLQUÍ | 275437 | 4.2% |
| LOJA, CABECERA CANTONAL Y CAPITAL PROVINCIAL | 267475 | 4.1% |
| CUENCA, CABECERA CANTONAL Y CAPITAL PROVINCIAL. | 260396 | 4.0% |
| RIOBAMBA, CABECERA CANTONAL Y CAPITAL PROVINCIAL | 250264 | 3.8% |
| MACHALA | 209247 | 3.2% |
| SAN MIGUEL DE IBARRA, CABECERA CANTONAL Y CAPITAL PROVINCIAL | 209011 | 3.2% |
| Other values (103) | 1526401 |
Length
| Value | Count | Frequency (%) |
| cabecera | 4634515 | |
| cantonal | 4634515 | |
| provincial | 3886686 | |
| y | 3886686 | |
| capital | 3886686 | |
| guayaquil | 1534108 | 4.3% |
| la | 1331267 | 3.7% |
| de | 1296446 | 3.6% |
| quito | 1034243 | 2.9% |
| distrito | 1034223 | 2.9% |
| Other values (134) | 8777337 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 43067194 | |
| 29360756 | ||
| C | 25021728 | |
| I | 20614998 | 7.4% |
| L | 20474575 | 7.4% |
| O | 18300755 | 6.6% |
| E | 16691669 | 6.0% |
| N | 15578883 | 5.6% |
| R | 14851407 | 5.4% |
| T | 14771482 | 5.3% |
| Other values (24) | 58155484 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 241581223 | |
| Space Separator | 29360756 | 10.6% |
| Other Punctuation | 5929134 | 2.1% |
| Open Punctuation | 8842 | < 0.1% |
| Close Punctuation | 8842 | < 0.1% |
| Dash Punctuation | 134 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 43067194 | |
| C | 25021728 | |
| I | 20614998 | |
| L | 20474575 | |
| O | 18300755 | |
| E | 16691669 | 6.9% |
| N | 15578883 | 6.4% |
| R | 14851407 | 6.1% |
| T | 14771482 | 6.1% |
| P | 10441957 | 4.3% |
| Other values (18) | 41766575 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5668738 | |
| . | 260396 | 4.4% |
Space Separator
| Value | Count | Frequency (%) |
| 29360756 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8842 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8842 |
Dash Punctuation
| Value | Count | Frequency (%) |
| – | 134 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 241581223 | |
| Common | 35307708 | 12.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 43067194 | |
| C | 25021728 | |
| I | 20614998 | |
| L | 20474575 | |
| O | 18300755 | |
| E | 16691669 | 6.9% |
| N | 15578883 | 6.4% |
| R | 14851407 | 6.1% |
| T | 14771482 | 6.1% |
| P | 10441957 | 4.3% |
| Other values (18) | 41766575 |
Common
| Value | Count | Frequency (%) |
| 29360756 | ||
| , | 5668738 | 16.1% |
| . | 260396 | 0.7% |
| ( | 8842 | < 0.1% |
| ) | 8842 | < 0.1% |
| – | 134 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 276464802 | |
| None | 423995 | 0.2% |
| Punctuation | 134 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 43067194 | |
| 29360756 | ||
| C | 25021728 | |
| I | 20614998 | 7.5% |
| L | 20474575 | 7.4% |
| O | 18300755 | 6.6% |
| E | 16691669 | 6.0% |
| N | 15578883 | 5.6% |
| R | 14851407 | 5.4% |
| T | 14771482 | 5.3% |
| Other values (18) | 57731355 |
None
| Value | Count | Frequency (%) |
| Í | 285093 | |
| Ñ | 103379 | 24.4% |
| Á | 32898 | 7.8% |
| Ó | 1478 | 0.3% |
| É | 1147 | 0.3% |
Punctuation
| Value | Count | Frequency (%) |
| – | 134 |
CAM_NOMBRE_CAMPUS
Categorical
| Distinct | 175 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| MATRIZ - GUAYAQUIL | |
|---|---|
| MATRIZ - QUITO | |
| MATRIZ - MILAGRO | |
| MATRIZ - PORTOVIEJO | |
| MATRIZ - CAMPUS CENTRAL | 278644 |
| Other values (170) |
Length
| Max length | 71 |
|---|---|
| Median length | 42 |
| Mean length | 16.526857 |
| Min length | 4 |
Characters and Unicode
| Total characters | 108389643 |
|---|---|
| Distinct characters | 41 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MATRIZ - QUITO |
|---|---|
| 2nd row | MATRIZ - MILAGRO |
| 3rd row | MATRIZ - LOJA |
| 4th row | MATRIZ - LOJA |
| 5th row | MATRIZ - LOJA |
Common Values
| Value | Count | Frequency (%) |
| MATRIZ - GUAYAQUIL | 1472238 | |
| MATRIZ - QUITO | 962724 | |
| MATRIZ - MILAGRO | 576897 | 8.8% |
| MATRIZ - PORTOVIEJO | 412884 | 6.3% |
| MATRIZ - CAMPUS CENTRAL | 278644 | 4.2% |
| MATRIZ - LOJA | 259307 | 3.9% |
| MATRIZ - AZUAY. | 240948 | 3.7% |
| MATRIZ - RIOBAMBA | 227095 | 3.5% |
| MATRIZ - MANTA | 205128 | 3.1% |
| MATRIZ - IBARRA | 204292 | 3.1% |
| Other values (165) | 1718237 |
Length
| Value | Count | Frequency (%) |
| 6182908 | ||
| matriz | 6131638 | |
| guayaquil | 1533519 | 7.7% |
| quito | 1017418 | 5.1% |
| milagro | 592853 | 3.0% |
| portoviejo | 413654 | 2.1% |
| campus | 278644 | 1.4% |
| central | 278644 | 1.4% |
| loja | 267750 | 1.3% |
| riobamba | 248834 | 1.3% |
| Other values (121) | 2913162 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 15398605 | |
| 13308845 | ||
| I | 10663945 | |
| T | 8709329 | |
| R | 8584857 | |
| M | 8029823 | 7.4% |
| Z | 6411592 | 5.9% |
| - | 6196591 | 5.7% |
| U | 5156211 | 4.8% |
| O | 4638772 | 4.3% |
| Other values (31) | 21291073 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 88621088 | |
| Space Separator | 13308845 | 12.3% |
| Dash Punctuation | 6196591 | 5.7% |
| Other Punctuation | 248155 | 0.2% |
| Lowercase Letter | 8030 | < 0.1% |
| Close Punctuation | 3467 | < 0.1% |
| Open Punctuation | 3467 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 15398605 | |
| I | 10663945 | |
| T | 8709329 | |
| R | 8584857 | |
| M | 8029823 | |
| Z | 6411592 | |
| U | 5156211 | 5.8% |
| O | 4638772 | 5.2% |
| L | 3742631 | 4.2% |
| Q | 2707045 | 3.1% |
| Other values (17) | 14578278 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5180 | |
| d | 2650 | |
| a | 60 | 0.7% |
| i | 40 | 0.5% |
| c | 40 | 0.5% |
| n | 20 | 0.2% |
| é | 20 | 0.2% |
| m | 20 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 244688 | |
| , | 3467 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 13308845 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6196591 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3467 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3467 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 88629118 | |
| Common | 19760525 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 15398605 | |
| I | 10663945 | |
| T | 8709329 | |
| R | 8584857 | |
| M | 8029823 | |
| Z | 6411592 | |
| U | 5156211 | 5.8% |
| O | 4638772 | 5.2% |
| L | 3742631 | 4.2% |
| Q | 2707045 | 3.1% |
| Other values (25) | 14586308 |
Common
| Value | Count | Frequency (%) |
| 13308845 | ||
| - | 6196591 | |
| . | 244688 | 1.2% |
| ) | 3467 | < 0.1% |
| , | 3467 | < 0.1% |
| ( | 3467 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 108373646 | |
| None | 15997 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 15398605 | |
| 13308845 | ||
| I | 10663945 | |
| T | 8709329 | |
| R | 8584857 | |
| M | 8029823 | 7.4% |
| Z | 6411592 | 5.9% |
| - | 6196591 | 5.7% |
| U | 5156211 | 4.8% |
| O | 4638772 | 4.3% |
| Other values (27) | 21275076 |
None
| Value | Count | Frequency (%) |
| Ó | 9970 | |
| Ñ | 3156 | 19.7% |
| Í | 2851 | 17.8% |
| é | 20 | 0.1% |
PRD_ID_SEGMENTO
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17562 |
| Missing (%) | 0.3% |
| Memory size | 50.2 MiB |
| 800.0 | |
|---|---|
| 676.0 | 80677 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 32791970 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 800.0 |
|---|---|
| 2nd row | 800.0 |
| 3rd row | 800.0 |
| 4th row | 800.0 |
| 5th row | 800.0 |
Common Values
| Value | Count | Frequency (%) |
| 800.0 | 6477717 | |
| 676.0 | 80677 | 1.2% |
| (Missing) | 17562 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 800.0 | 6477717 | |
| 676.0 | 80677 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 19513828 | |
| . | 6558394 | 20.0% |
| 8 | 6477717 | 19.8% |
| 6 | 161354 | 0.5% |
| 7 | 80677 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26233576 | |
| Other Punctuation | 6558394 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 19513828 | |
| 8 | 6477717 | 24.7% |
| 6 | 161354 | 0.6% |
| 7 | 80677 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6558394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 32791970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 19513828 | |
| . | 6558394 | 20.0% |
| 8 | 6477717 | 19.8% |
| 6 | 161354 | 0.5% |
| 7 | 80677 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32791970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 19513828 | |
| . | 6558394 | 20.0% |
| 8 | 6477717 | 19.8% |
| 6 | 161354 | 0.5% |
| 7 | 80677 | 0.2% |
SEGMETO_CARRERA
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| OFERTA PÚBLICA | |
|---|---|
| POLITICA DE ACCION AFIRMATIVA | 80677 |
| POBLACION GENERAL | 5457 |
Length
| Max length | 29 |
|---|---|
| Median length | 14 |
| Mean length | 14.186517 |
| Min length | 14 |
Characters and Unicode
| Total characters | 93289910 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OFERTA PÚBLICA |
|---|---|
| 2nd row | OFERTA PÚBLICA |
| 3rd row | OFERTA PÚBLICA |
| 4th row | OFERTA PÚBLICA |
| 5th row | OFERTA PÚBLICA |
Common Values
| Value | Count | Frequency (%) |
| OFERTA PÚBLICA | 6489822 | |
| POLITICA DE ACCION AFIRMATIVA | 80677 | 1.2% |
| POBLACION GENERAL | 5457 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| oferta | 6489822 | |
| pública | 6489822 | |
| politica | 80677 | 0.6% |
| de | 80677 | 0.6% |
| accion | 80677 | 0.6% |
| afirmativa | 80677 | 0.6% |
| poblacion | 5457 | < 0.1% |
| general | 5457 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 13393943 | |
| I | 6898664 | 7.4% |
| 6737310 | 7.2% | |
| C | 6737310 | 7.2% |
| O | 6662090 | 7.1% |
| T | 6651176 | 7.1% |
| E | 6581413 | 7.1% |
| L | 6581413 | 7.1% |
| R | 6575956 | 7.0% |
| P | 6575956 | 7.0% |
| Other values (8) | 19894679 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 86552600 | |
| Space Separator | 6737310 | 7.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 13393943 | |
| I | 6898664 | |
| C | 6737310 | |
| O | 6662090 | |
| T | 6651176 | |
| E | 6581413 | |
| L | 6581413 | |
| R | 6575956 | |
| P | 6575956 | |
| F | 6570499 | |
| Other values (7) | 13324180 |
Space Separator
| Value | Count | Frequency (%) |
| 6737310 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 86552600 | |
| Common | 6737310 | 7.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 13393943 | |
| I | 6898664 | |
| C | 6737310 | |
| O | 6662090 | |
| T | 6651176 | |
| E | 6581413 | |
| L | 6581413 | |
| R | 6575956 | |
| P | 6575956 | |
| F | 6570499 | |
| Other values (7) | 13324180 |
Common
| Value | Count | Frequency (%) |
| 6737310 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86800088 | |
| None | 6489822 | 7.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 13393943 | |
| I | 6898664 | |
| 6737310 | ||
| C | 6737310 | |
| O | 6662090 | |
| T | 6651176 | |
| E | 6581413 | |
| L | 6581413 | |
| R | 6575956 | |
| P | 6575956 | |
| Other values (7) | 13404857 |
None
| Value | Count | Frequency (%) |
| Ú | 6489822 |
cod_final
Real number (ℝ)
| Distinct | 505868 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5875693 × 109 |
| Minimum | 1.0000009 × 109 |
|---|---|
| Maximum | 9.999701 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 50.2 MiB |
Quantile statistics
| Minimum | 1.0000009 × 109 |
|---|---|
| 5-th percentile | 1.2353309 × 109 |
| Q1 | 1.9671712 × 109 |
| median | 2.3294514 × 109 |
| Q3 | 2.5752612 × 109 |
| 95-th percentile | 6.5997014 × 109 |
| Maximum | 9.999701 × 109 |
| Range | 8.9997001 × 109 |
| Interquartile range (IQR) | 6.0809009 × 108 |
Descriptive statistics
| Standard deviation | 1.5393558 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.5949042 |
| Kurtosis | 9.5742217 |
| Mean | 2.5875693 × 109 |
| Median Absolute Deviation (MAD) | 2.8190041 × 108 |
| Skewness | 3.0699059 |
| Sum | 1.7015742 × 1016 |
| Variance | 2.3696164 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2154521774 | 196 | < 0.1% |
| 2151881765 | 184 | < 0.1% |
| 2186391738 | 179 | < 0.1% |
| 2465590901 | 177 | < 0.1% |
| 2170491792 | 175 | < 0.1% |
| 2165561738 | 170 | < 0.1% |
| 2370820965 | 168 | < 0.1% |
| 2146491729 | 160 | < 0.1% |
| 2178341701 | 159 | < 0.1% |
| 2438500956 | 157 | < 0.1% |
| Other values (505858) | 6574231 |
| Value | Count | Frequency (%) |
| 1000000901 | 5 | < 0.1% |
| 1000011238 | 9 | |
| 1000011729 | 5 | < 0.1% |
| 1000030892 | 5 | < 0.1% |
| 1000050374 | 2 | < 0.1% |
| 1000051774 | 4 | < 0.1% |
| 1000070956 | 3 | < 0.1% |
| 1000080147 | 3 | < 0.1% |
| 1000090929 | 18 | |
| 1000100829 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 9999700965 | 15 | |
| 9999630929 | 12 | |
| 9999501765 | 14 | |
| 9999440129 | 9 | |
| 9999392356 | 2 | < 0.1% |
| 9999360901 | 5 | < 0.1% |
| 9999351383 | 5 | < 0.1% |
| 9999311210 | 11 | |
| 9999300983 | 3 | < 0.1% |
| 9999291383 | 4 | < 0.1% |
archivo
Categorical
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.2 MiB |
| primera_postulacion_per22.csv | |
|---|---|
| primera_postulacion_per19.csv | |
| primera_postulacion_per21.csv | |
| primera_postulacion_per20.csv | |
| primera_postulacion_per18.csv | |
| Other values (16) |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 28.989597 |
| Min length | 28 |
Characters and Unicode
| Total characters | 190634314 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | tercera_postulacion_per22.csv |
|---|---|
| 2nd row | tercera_postulacion_per22.csv |
| 3rd row | tercera_postulacion_per22.csv |
| 4th row | tercera_postulacion_per22.csv |
| 5th row | tercera_postulacion_per22.csv |
Common Values
| Value | Count | Frequency (%) |
| primera_postulacion_per22.csv | 861282 | |
| primera_postulacion_per19.csv | 744253 | |
| primera_postulacion_per21.csv | 706982 | |
| primera_postulacion_per20.csv | 699663 | |
| primera_postulacion_per18.csv | 585182 | |
| segunda_postulacion_per22.csv | 404773 | 6.2% |
| segunda_postulacion_per19.csv | 400010 | 6.1% |
| segunda_postulacion_per21.csv | 366405 | 5.6% |
| segunda_postulacion_per20.csv | 333697 | 5.1% |
| segunda_postulacion_per18.csv | 297845 | 4.5% |
| Other values (11) | 1175864 |
Length
| Value | Count | Frequency (%) |
| primera_postulacion_per22.csv | 861282 | |
| primera_postulacion_per19.csv | 744253 | |
| primera_postulacion_per21.csv | 706982 | |
| primera_postulacion_per20.csv | 699663 | |
| primera_postulacion_per18.csv | 585182 | |
| segunda_postulacion_per22.csv | 404773 | 6.2% |
| segunda_postulacion_per19.csv | 400010 | 6.1% |
| segunda_postulacion_per21.csv | 366405 | 5.6% |
| segunda_postulacion_per20.csv | 333697 | 5.1% |
| segunda_postulacion_per18.csv | 297845 | 4.5% |
| Other values (11) | 1175864 |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 16748589 | 8.8% |
| r | 15999939 | 8.4% |
| s | 14954642 | 7.8% |
| c | 14308841 | 7.5% |
| e | 14052515 | 7.4% |
| a | 13306074 | 7.0% |
| _ | 13169474 | 6.9% |
| o | 13133665 | 6.9% |
| i | 10176061 | 5.3% |
| u | 8532851 | 4.5% |
| Other values (14) | 56251663 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 157736972 | |
| Connector Punctuation | 13169474 | 6.9% |
| Decimal Number | 13151912 | 6.9% |
| Other Punctuation | 6575956 | 3.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 16748589 | |
| r | 15999939 | |
| s | 14954642 | |
| c | 14308841 | |
| e | 14052515 | |
| a | 13306074 | |
| o | 13133665 | |
| i | 10176061 | 6.5% |
| u | 8532851 | 5.4% |
| n | 8363867 | 5.3% |
| Other values (7) | 28159928 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5750067 | |
| 1 | 3755201 | |
| 9 | 1401656 | 10.7% |
| 0 | 1200308 | 9.1% |
| 8 | 1044680 | 7.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 13169474 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6575956 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 157736972 | |
| Common | 32897342 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| p | 16748589 | |
| r | 15999939 | |
| s | 14954642 | |
| c | 14308841 | |
| e | 14052515 | |
| a | 13306074 | |
| o | 13133665 | |
| i | 10176061 | 6.5% |
| u | 8532851 | 5.4% |
| n | 8363867 | 5.3% |
| Other values (7) | 28159928 |
Common
| Value | Count | Frequency (%) |
| _ | 13169474 | |
| . | 6575956 | |
| 2 | 5750067 | |
| 1 | 3755201 | 11.4% |
| 9 | 1401656 | 4.3% |
| 0 | 1200308 | 3.6% |
| 8 | 1044680 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 190634314 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| p | 16748589 | 8.8% |
| r | 15999939 | 8.4% |
| s | 14954642 | 7.8% |
| c | 14308841 | 7.5% |
| e | 14052515 | 7.4% |
| a | 13306074 | 7.0% |
| _ | 13169474 | 6.9% |
| o | 13133665 | 6.9% |
| i | 10176061 | 5.3% |
| u | 8532851 | 4.5% |
| Other values (14) | 56251663 |
| Unnamed: 0 | INS_ID | INI_ID | CAE_NOTA_POSTULA | POS_ID | CUS_ID | NOTA_POSTULA | PRD_ID_NUM_POSTULACION | IES_ID | OFA_ID | APC_ID | CCP_ID | CAR_ID | MODALIDAD_ID | AREA_ID | SUBAREA_ID | cod_final | PER_ID | INS_POBLACION | INS_TIPO_INSCRIPCION | SEGMENTO_ASPIRANTE | CAE_GRUPO | POS_PRIORIDAD | POS_ESTADO | IES_TIPO_IES | IES_TIPO_FINANCIAMIENTO | IES_ESTADO | MODALIDAD | JORNADA_ID | JORNADA | NIVEL | AREA_NOMBRE | SUBAREA_NOMBRE | PROVINCIA | CANTON | PRD_ID_SEGMENTO | SEGMETO_CARRERA | archivo | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Unnamed: 0 | 1.000 | 0.061 | 0.061 | 0.116 | -0.033 | -0.024 | 0.121 | -0.536 | -0.043 | 0.070 | 0.062 | 0.083 | -0.027 | -0.027 | 0.050 | 0.004 | 0.049 | 0.138 | 0.062 | 0.070 | 0.061 | 0.069 | 0.017 | 0.005 | 0.029 | 0.018 | 0.008 | 0.033 | 0.110 | 0.110 | 0.033 | 0.040 | 0.046 | 0.229 | 0.233 | 0.032 | 0.045 | 0.252 |
| INS_ID | 0.061 | 1.000 | 1.000 | 0.207 | 0.962 | 0.958 | 0.215 | 0.041 | -0.032 | 0.958 | 0.958 | 0.955 | 0.005 | 0.095 | -0.007 | 0.031 | 0.286 | 0.925 | 0.206 | 0.638 | 0.149 | 0.331 | 0.016 | 0.012 | 0.037 | 0.019 | 0.017 | 0.073 | 0.134 | 0.133 | 0.153 | 0.038 | 0.042 | 0.117 | 0.126 | 0.080 | 0.065 | 0.618 |
| INI_ID | 0.061 | 1.000 | 1.000 | 0.208 | 0.962 | 0.958 | 0.215 | 0.040 | -0.032 | 0.958 | 0.958 | 0.955 | 0.005 | 0.095 | -0.007 | 0.031 | 0.287 | 0.924 | 0.198 | 0.570 | 0.164 | 0.335 | 0.016 | 0.012 | 0.035 | 0.023 | 0.016 | 0.072 | 0.135 | 0.135 | 0.153 | 0.038 | 0.042 | 0.120 | 0.129 | 0.083 | 0.066 | 0.617 |
| CAE_NOTA_POSTULA | 0.116 | 0.207 | 0.208 | 1.000 | 0.186 | 0.200 | 0.995 | -0.157 | -0.181 | 0.228 | 0.226 | 0.251 | -0.089 | -0.087 | 0.050 | 0.062 | 0.185 | 0.176 | 0.071 | 0.105 | 0.127 | 0.138 | 0.016 | 0.006 | 0.145 | 0.020 | 0.005 | 0.055 | 0.130 | 0.130 | 0.097 | 0.093 | 0.103 | 0.107 | 0.111 | 0.010 | 0.010 | 0.136 |
| POS_ID | -0.033 | 0.962 | 0.962 | 0.186 | 1.000 | 0.992 | 0.195 | 0.219 | -0.042 | 0.958 | 0.958 | 0.957 | 0.015 | 0.101 | -0.013 | 0.027 | 0.305 | 0.873 | 0.186 | 0.093 | 0.140 | 0.316 | 0.025 | 0.016 | 0.028 | 0.032 | 0.020 | 0.073 | 0.110 | 0.109 | 0.144 | 0.041 | 0.045 | 0.086 | 0.100 | 0.079 | 0.077 | 0.800 |
| CUS_ID | -0.024 | 0.958 | 0.958 | 0.200 | 0.992 | 1.000 | 0.208 | 0.221 | -0.062 | 0.958 | 0.958 | 0.957 | 0.010 | 0.103 | -0.013 | 0.032 | 0.310 | 0.928 | 0.186 | 0.103 | 0.140 | 0.347 | 0.025 | 0.016 | 0.112 | 0.047 | 0.018 | 0.081 | 0.122 | 0.122 | 0.171 | 0.050 | 0.056 | 0.130 | 0.145 | 0.107 | 0.107 | 0.893 |
| NOTA_POSTULA | 0.121 | 0.215 | 0.215 | 0.995 | 0.195 | 0.208 | 1.000 | -0.154 | -0.174 | 0.236 | 0.234 | 0.257 | -0.089 | -0.085 | 0.049 | 0.062 | 0.188 | 0.181 | 0.069 | 0.102 | 0.125 | 0.140 | 0.016 | 0.006 | 0.143 | 0.020 | 0.005 | 0.055 | 0.126 | 0.126 | 0.095 | 0.093 | 0.102 | 0.105 | 0.110 | 0.010 | 0.010 | 0.137 |
| PRD_ID_NUM_POSTULACION | -0.536 | 0.041 | 0.040 | -0.157 | 0.219 | 0.221 | -0.154 | 1.000 | 0.006 | 0.038 | 0.040 | 0.033 | 0.060 | 0.079 | -0.053 | -0.025 | -0.019 | 0.123 | 0.057 | 0.007 | 0.067 | 0.086 | 0.067 | 0.000 | 0.005 | 0.022 | 0.001 | 0.043 | 0.071 | 0.043 | 0.037 | 0.052 | 0.057 | 0.064 | 0.087 | 0.039 | 0.452 | 1.000 |
| IES_ID | -0.043 | -0.032 | -0.032 | -0.181 | -0.042 | -0.062 | -0.174 | 0.006 | 1.000 | -0.044 | -0.036 | -0.080 | 0.156 | 0.109 | -0.049 | -0.028 | -0.038 | 0.022 | 0.022 | 0.025 | 0.021 | 0.028 | 0.014 | 0.002 | 0.948 | 0.471 | 0.019 | 0.169 | 0.169 | 0.170 | 0.466 | 0.167 | 0.235 | 0.302 | 0.498 | 0.347 | 0.242 | 0.055 |
| OFA_ID | 0.070 | 0.958 | 0.958 | 0.228 | 0.958 | 0.958 | 0.236 | 0.038 | -0.044 | 1.000 | 0.958 | 0.958 | -0.005 | 0.098 | -0.002 | 0.036 | 0.314 | 0.844 | 0.174 | 0.103 | 0.134 | 0.423 | 0.017 | 0.012 | 0.057 | 0.024 | 0.015 | 0.073 | 0.107 | 0.106 | 0.166 | 0.058 | 0.077 | 0.113 | 0.135 | 0.086 | 0.067 | 0.689 |
| APC_ID | 0.062 | 0.958 | 0.958 | 0.226 | 0.958 | 0.958 | 0.234 | 0.040 | -0.036 | 0.958 | 1.000 | 0.957 | 0.012 | 0.094 | 0.003 | 0.031 | 0.314 | 0.901 | 0.177 | 0.104 | 0.133 | 0.392 | 0.016 | 0.012 | 0.057 | 0.021 | 0.019 | 0.073 | 0.111 | 0.111 | 0.163 | 0.070 | 0.086 | 0.116 | 0.133 | 0.078 | 0.078 | 0.682 |
| CCP_ID | 0.083 | 0.955 | 0.955 | 0.251 | 0.957 | 0.957 | 0.257 | 0.033 | -0.080 | 0.958 | 0.957 | 1.000 | -0.005 | 0.070 | 0.003 | 0.043 | 0.323 | 0.954 | 0.216 | 0.135 | 0.146 | 0.348 | 0.016 | 0.012 | 0.170 | 0.052 | 0.025 | 0.131 | 0.232 | 0.231 | 0.207 | 0.067 | 0.078 | 0.271 | 0.339 | 0.087 | 0.071 | 0.637 |
| CAR_ID | -0.027 | 0.005 | 0.005 | -0.089 | 0.015 | 0.010 | -0.089 | 0.060 | 0.156 | -0.005 | 0.012 | -0.005 | 1.000 | -0.016 | -0.027 | -0.004 | -0.011 | 0.027 | 0.022 | 0.024 | 0.021 | 0.024 | 0.021 | 0.002 | 0.530 | 0.104 | 0.043 | 0.114 | 0.141 | 0.140 | 0.298 | 0.369 | 0.519 | 0.135 | 0.206 | 0.084 | 0.060 | 0.050 |
| MODALIDAD_ID | -0.027 | 0.095 | 0.095 | -0.087 | 0.101 | 0.103 | -0.085 | 0.079 | 0.109 | 0.098 | 0.094 | 0.070 | -0.016 | 1.000 | 0.009 | 0.106 | 0.016 | 0.085 | 0.090 | 0.040 | 0.025 | 0.065 | 0.021 | 0.001 | 0.017 | 0.057 | 0.003 | 1.000 | 0.577 | 0.577 | 0.020 | 0.284 | 0.314 | 0.217 | 0.463 | 0.055 | 0.055 | 0.116 |
| AREA_ID | 0.050 | -0.007 | -0.007 | 0.050 | -0.013 | -0.013 | 0.049 | -0.053 | -0.049 | -0.002 | 0.003 | 0.003 | -0.027 | 0.009 | 1.000 | 0.223 | 0.013 | 0.042 | 0.071 | 0.042 | 0.019 | 0.030 | 0.026 | 0.002 | 0.285 | 0.095 | 0.010 | 0.229 | 0.244 | 0.244 | 0.152 | 1.000 | 0.877 | 0.118 | 0.195 | 0.073 | 0.073 | 0.065 |
| SUBAREA_ID | 0.004 | 0.031 | 0.031 | 0.062 | 0.027 | 0.032 | 0.062 | -0.025 | -0.028 | 0.036 | 0.031 | 0.043 | -0.004 | 0.106 | 0.223 | 1.000 | 0.006 | 0.028 | 0.047 | 0.018 | 0.006 | 0.025 | 0.012 | 0.001 | 0.157 | 0.022 | 0.004 | 0.156 | 0.144 | 0.144 | 0.121 | 0.775 | 0.996 | 0.110 | 0.199 | 0.012 | 0.012 | 0.049 |
| cod_final | 0.049 | 0.286 | 0.287 | 0.185 | 0.305 | 0.310 | 0.188 | -0.019 | -0.038 | 0.314 | 0.314 | 0.323 | -0.011 | 0.016 | 0.013 | 0.006 | 1.000 | 0.045 | 0.328 | 0.211 | 0.026 | 0.030 | 0.004 | 0.001 | 0.031 | 0.010 | 0.003 | 0.061 | 0.082 | 0.082 | 0.017 | 0.033 | 0.035 | 0.040 | 0.046 | 0.008 | 0.006 | 0.032 |
| PER_ID | 0.138 | 0.925 | 0.924 | 0.176 | 0.873 | 0.928 | 0.181 | 0.123 | 0.022 | 0.844 | 0.901 | 0.954 | 0.027 | 0.085 | 0.042 | 0.028 | 0.045 | 1.000 | 0.188 | 0.107 | 0.152 | 0.524 | 0.016 | 0.012 | 0.035 | 0.018 | 0.017 | 0.075 | 0.123 | 0.122 | 0.161 | 0.052 | 0.058 | 0.162 | 0.175 | 0.083 | 0.067 | 1.000 |
| INS_POBLACION | 0.062 | 0.206 | 0.198 | 0.071 | 0.186 | 0.186 | 0.069 | 0.057 | 0.022 | 0.174 | 0.177 | 0.216 | 0.022 | 0.090 | 0.071 | 0.047 | 0.328 | 0.188 | 1.000 | 1.000 | 0.076 | 0.096 | 0.002 | 0.005 | 0.027 | 0.006 | 0.004 | 0.106 | 0.120 | 0.120 | 0.034 | 0.075 | 0.078 | 0.070 | 0.089 | 0.013 | 0.013 | 0.189 |
| INS_TIPO_INSCRIPCION | 0.070 | 0.638 | 0.570 | 0.105 | 0.093 | 0.103 | 0.102 | 0.007 | 0.025 | 0.103 | 0.104 | 0.135 | 0.024 | 0.040 | 0.042 | 0.018 | 0.211 | 0.107 | 1.000 | 1.000 | 0.119 | 0.136 | 0.002 | 0.001 | 0.022 | 0.015 | 0.002 | 0.041 | 0.080 | 0.080 | 0.021 | 0.046 | 0.051 | 0.114 | 0.119 | 0.021 | 0.021 | 0.116 |
| SEGMENTO_ASPIRANTE | 0.061 | 0.149 | 0.164 | 0.127 | 0.140 | 0.140 | 0.125 | 0.067 | 0.021 | 0.134 | 0.133 | 0.146 | 0.021 | 0.025 | 0.019 | 0.006 | 0.026 | 0.152 | 0.076 | 0.119 | 1.000 | 0.866 | 0.005 | 0.001 | 0.010 | 0.057 | 0.001 | 0.023 | 0.031 | 0.031 | 0.012 | 0.027 | 0.032 | 0.089 | 0.097 | 0.133 | 0.133 | 0.154 |
| CAE_GRUPO | 0.069 | 0.331 | 0.335 | 0.138 | 0.316 | 0.347 | 0.140 | 0.086 | 0.028 | 0.423 | 0.392 | 0.348 | 0.024 | 0.065 | 0.030 | 0.025 | 0.030 | 0.524 | 0.096 | 0.136 | 0.866 | 1.000 | 0.007 | 0.012 | 0.033 | 0.068 | 0.004 | 0.048 | 0.067 | 0.067 | 0.162 | 0.028 | 0.026 | 0.057 | 0.063 | 0.149 | 0.149 | 0.271 |
| POS_PRIORIDAD | 0.017 | 0.016 | 0.016 | 0.016 | 0.025 | 0.025 | 0.016 | 0.067 | 0.014 | 0.017 | 0.016 | 0.016 | 0.021 | 0.021 | 0.026 | 0.012 | 0.004 | 0.016 | 0.002 | 0.002 | 0.005 | 0.007 | 1.000 | 0.000 | 0.017 | 0.008 | 0.002 | 0.019 | 0.017 | 0.017 | 0.010 | 0.028 | 0.046 | 0.020 | 0.029 | 0.006 | 0.037 | 0.068 |
| POS_ESTADO | 0.005 | 0.012 | 0.012 | 0.006 | 0.016 | 0.016 | 0.006 | 0.000 | 0.002 | 0.012 | 0.012 | 0.012 | 0.002 | 0.001 | 0.002 | 0.001 | 0.001 | 0.012 | 0.005 | 0.001 | 0.001 | 0.012 | 0.000 | 1.000 | 0.001 | 0.001 | 0.000 | 0.001 | 0.002 | 0.002 | 0.005 | 0.003 | 0.006 | 0.005 | 0.006 | 0.001 | 0.001 | 0.017 |
| IES_TIPO_IES | 0.029 | 0.037 | 0.035 | 0.145 | 0.028 | 0.112 | 0.143 | 0.005 | 0.948 | 0.057 | 0.057 | 0.170 | 0.530 | 0.017 | 0.285 | 0.157 | 0.031 | 0.035 | 0.027 | 0.022 | 0.010 | 0.033 | 0.017 | 0.001 | 1.000 | 0.198 | 0.002 | 0.359 | 0.338 | 0.338 | 0.971 | 0.310 | 0.459 | 0.213 | 0.403 | 0.092 | 0.092 | 0.061 |
| IES_TIPO_FINANCIAMIENTO | 0.018 | 0.019 | 0.023 | 0.020 | 0.032 | 0.047 | 0.020 | 0.022 | 0.471 | 0.024 | 0.021 | 0.052 | 0.104 | 0.057 | 0.095 | 0.022 | 0.010 | 0.018 | 0.006 | 0.015 | 0.057 | 0.068 | 0.008 | 0.001 | 0.198 | 1.000 | 0.012 | 0.093 | 0.095 | 0.095 | 0.165 | 0.127 | 0.131 | 0.089 | 0.167 | 0.764 | 0.764 | 0.054 |
| IES_ESTADO | 0.008 | 0.017 | 0.016 | 0.005 | 0.020 | 0.018 | 0.005 | 0.001 | 0.019 | 0.015 | 0.019 | 0.025 | 0.043 | 0.003 | 0.010 | 0.004 | 0.003 | 0.017 | 0.004 | 0.002 | 0.001 | 0.004 | 0.002 | 0.000 | 0.002 | 0.012 | 1.000 | 0.066 | 0.017 | 0.017 | 0.003 | 0.016 | 0.021 | 0.015 | 0.017 | 0.001 | 0.001 | 0.029 |
| MODALIDAD | 0.033 | 0.073 | 0.072 | 0.055 | 0.073 | 0.081 | 0.055 | 0.043 | 0.169 | 0.073 | 0.073 | 0.131 | 0.114 | 1.000 | 0.229 | 0.156 | 0.061 | 0.075 | 0.106 | 0.041 | 0.023 | 0.048 | 0.019 | 0.001 | 0.359 | 0.093 | 0.066 | 1.000 | 0.500 | 0.500 | 0.178 | 0.246 | 0.340 | 0.227 | 0.380 | 0.080 | 0.056 | 0.091 |
| JORNADA_ID | 0.110 | 0.134 | 0.135 | 0.130 | 0.110 | 0.122 | 0.126 | 0.071 | 0.169 | 0.107 | 0.111 | 0.232 | 0.141 | 0.577 | 0.244 | 0.144 | 0.082 | 0.123 | 0.120 | 0.080 | 0.031 | 0.067 | 0.017 | 0.002 | 0.338 | 0.095 | 0.017 | 0.500 | 1.000 | 1.000 | 0.165 | 0.261 | 0.276 | 0.419 | 0.545 | 0.083 | 0.083 | 0.140 |
| JORNADA | 0.110 | 0.133 | 0.135 | 0.130 | 0.109 | 0.122 | 0.126 | 0.043 | 0.170 | 0.106 | 0.111 | 0.231 | 0.140 | 0.577 | 0.244 | 0.144 | 0.082 | 0.122 | 0.120 | 0.080 | 0.031 | 0.067 | 0.017 | 0.002 | 0.338 | 0.095 | 0.017 | 0.500 | 1.000 | 1.000 | 0.165 | 0.261 | 0.276 | 0.419 | 0.545 | 0.083 | 0.059 | 0.142 |
| NIVEL | 0.033 | 0.153 | 0.153 | 0.097 | 0.144 | 0.171 | 0.095 | 0.037 | 0.466 | 0.166 | 0.163 | 0.207 | 0.298 | 0.020 | 0.152 | 0.121 | 0.017 | 0.161 | 0.034 | 0.021 | 0.012 | 0.162 | 0.010 | 0.005 | 0.971 | 0.165 | 0.003 | 0.178 | 0.165 | 0.165 | 1.000 | 0.183 | 0.261 | 0.119 | 0.212 | 0.117 | 0.084 | 0.169 |
| AREA_NOMBRE | 0.040 | 0.038 | 0.038 | 0.093 | 0.041 | 0.050 | 0.093 | 0.052 | 0.167 | 0.058 | 0.070 | 0.067 | 0.369 | 0.284 | 1.000 | 0.775 | 0.033 | 0.052 | 0.075 | 0.046 | 0.027 | 0.028 | 0.028 | 0.003 | 0.310 | 0.127 | 0.016 | 0.246 | 0.261 | 0.261 | 0.183 | 1.000 | 0.905 | 0.112 | 0.174 | 0.097 | 0.097 | 0.049 |
| SUBAREA_NOMBRE | 0.046 | 0.042 | 0.042 | 0.103 | 0.045 | 0.056 | 0.102 | 0.057 | 0.235 | 0.077 | 0.086 | 0.078 | 0.519 | 0.314 | 0.877 | 0.996 | 0.035 | 0.058 | 0.078 | 0.051 | 0.032 | 0.026 | 0.046 | 0.006 | 0.459 | 0.131 | 0.021 | 0.340 | 0.276 | 0.276 | 0.261 | 0.905 | 1.000 | 0.131 | 0.179 | 0.094 | 0.094 | 0.053 |
| PROVINCIA | 0.229 | 0.117 | 0.120 | 0.107 | 0.086 | 0.130 | 0.105 | 0.064 | 0.302 | 0.113 | 0.116 | 0.271 | 0.135 | 0.217 | 0.118 | 0.110 | 0.040 | 0.162 | 0.070 | 0.114 | 0.089 | 0.057 | 0.020 | 0.005 | 0.213 | 0.089 | 0.015 | 0.227 | 0.419 | 0.419 | 0.119 | 0.112 | 0.131 | 1.000 | 1.000 | 0.093 | 0.095 | 0.084 |
| CANTON | 0.233 | 0.126 | 0.129 | 0.111 | 0.100 | 0.145 | 0.110 | 0.087 | 0.498 | 0.135 | 0.133 | 0.339 | 0.206 | 0.463 | 0.195 | 0.199 | 0.046 | 0.175 | 0.089 | 0.119 | 0.097 | 0.063 | 0.029 | 0.006 | 0.403 | 0.167 | 0.017 | 0.380 | 0.545 | 0.545 | 0.212 | 0.174 | 0.179 | 1.000 | 1.000 | 0.197 | 0.162 | 0.100 |
| PRD_ID_SEGMENTO | 0.032 | 0.080 | 0.083 | 0.010 | 0.079 | 0.107 | 0.010 | 0.039 | 0.347 | 0.086 | 0.078 | 0.087 | 0.084 | 0.055 | 0.073 | 0.012 | 0.008 | 0.083 | 0.013 | 0.021 | 0.133 | 0.149 | 0.006 | 0.001 | 0.092 | 0.764 | 0.001 | 0.080 | 0.083 | 0.083 | 0.117 | 0.097 | 0.094 | 0.093 | 0.197 | 1.000 | 1.000 | 0.093 |
| SEGMETO_CARRERA | 0.045 | 0.065 | 0.066 | 0.010 | 0.077 | 0.107 | 0.010 | 0.452 | 0.242 | 0.067 | 0.078 | 0.071 | 0.060 | 0.055 | 0.073 | 0.012 | 0.006 | 0.067 | 0.013 | 0.021 | 0.133 | 0.149 | 0.037 | 0.001 | 0.092 | 0.764 | 0.001 | 0.056 | 0.083 | 0.059 | 0.084 | 0.097 | 0.094 | 0.095 | 0.162 | 1.000 | 1.000 | 0.710 |
| archivo | 0.252 | 0.618 | 0.617 | 0.136 | 0.800 | 0.893 | 0.137 | 1.000 | 0.055 | 0.689 | 0.682 | 0.637 | 0.050 | 0.116 | 0.065 | 0.049 | 0.032 | 1.000 | 0.189 | 0.116 | 0.154 | 0.271 | 0.068 | 0.017 | 0.061 | 0.054 | 0.029 | 0.091 | 0.140 | 0.142 | 0.169 | 0.049 | 0.053 | 0.084 | 0.100 | 0.093 | 0.710 | 1.000 |
| Unnamed: 0 | INS_ID | INI_ID | PER_ID | INS_POBLACION | INS_TIPO_INSCRIPCION | SEGMENTO_ASPIRANTE | CAE_GRUPO | CAE_ESTADO | CAE_NOTA_POSTULA | POS_ID | POS_FECHA_POSTULACION | CUS_ID | NOTA_POSTULA | PRD_ID_NUM_POSTULACION | POS_PRIORIDAD | POS_ESTADO | IES_ID | IES_NOMBRE_INSTIT | IES_TIPO_IES | IES_TIPO_FINANCIAMIENTO | OFA_ID | IES_ESTADO | APC_ID | CCP_ID | CAR_ID | CAR_NOMBRE_CARRERA | MODALIDAD_ID | MODALIDAD | JORNADA_ID | JORNADA | NIVEL | AREA_ID | AREA_NOMBRE | SUBAREA_ID | SUBAREA_NOMBRE | PROVINCIA | CANTON | PARROQUIA | CAM_NOMBRE_CAMPUS | PRD_ID_SEGMENTO | SEGMETO_CARRERA | cod_final | archivo | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 12263583.0 | 7459572.0 | 22 | NaN | 3.0 | POBLACION GENERAL | POBLACION GENERAL | 1.0 | 714.0 | 29103938.0 | 27/10/2021 15:27 | 311701.0 | 714.0 | 494 | 2 | 1.0 | 46 | UNIVERSIDAD CENTRAL DEL ECUADOR | U | PÚBLICA | 177772 | A | 45466.0 | 31979 | 5455 | ADMINISTRACION DE EMPRESAS | 8.0 | DISTANCIA | 3.0 | NO APLICA JORNADA | TERCER NIVEL | 9.0 | ADMINISTRACION | 500.0 | EDUCACION COMERCIAL Y ADMINISTRACION | PICHINCHA | DISTRITO METROPOLITANO DE QUITO | QUITO DISTRITO METROPOLITANO, CABECERA CANTONAL, CAPITAL PROVINCIAL Y DE LA REPUBLICA DEL ECUADOR | MATRIZ - QUITO | 800.0 | OFERTA PÚBLICA | 4896670101 | tercera_postulacion_per22.csv |
| 1 | 2 | 12263583.0 | 7459572.0 | 22 | NaN | 3.0 | POBLACION GENERAL | POBLACION GENERAL | 1.0 | 714.0 | 29103941.0 | 27/10/2021 15:27 | 311785.0 | 714.0 | 494 | 5 | 1.0 | 59 | UNIVERSIDAD ESTATAL DE MILAGRO | U | PÚBLICA | 174130 | A | 47779.0 | 30194 | 5205 | COMUNICACION | 441.0 | EN LINEA | 3.0 | NO APLICA JORNADA | TERCER NIVEL | 13.0 | CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO | 15.0 | PERIODISMO E INFORMACION | GUAYAS | MILAGRO | MILAGRO, CABECERA CANTONAL | MATRIZ - MILAGRO | 800.0 | OFERTA PÚBLICA | 4896670101 | tercera_postulacion_per22.csv |
| 2 | 3 | 12263583.0 | 7459572.0 | 22 | NaN | 3.0 | POBLACION GENERAL | POBLACION GENERAL | 1.0 | 714.0 | 29103939.0 | 27/10/2021 15:27 | 312325.0 | 714.0 | 494 | 3 | 1.0 | 72 | UNIVERSIDAD NACIONAL DE LOJA | U | PÚBLICA | 175203 | A | 45847.0 | 29748 | 4781 | DERECHO | 8.0 | DISTANCIA | 3.0 | NO APLICA JORNADA | TERCER NIVEL | 13.0 | CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO | 510.0 | DERECHO | LOJA | LOJA | LOJA, CABECERA CANTONAL Y CAPITAL PROVINCIAL | MATRIZ - LOJA | 800.0 | OFERTA PÚBLICA | 4896670101 | tercera_postulacion_per22.csv |
| 3 | 4 | 12263583.0 | 7459572.0 | 22 | NaN | 3.0 | POBLACION GENERAL | POBLACION GENERAL | 1.0 | 714.0 | 29103937.0 | 27/10/2021 15:27 | 312048.0 | 714.0 | 494 | 1 | 1.0 | 72 | UNIVERSIDAD NACIONAL DE LOJA | U | PÚBLICA | 180652 | A | 48104.0 | 29743 | 5205 | COMUNICACION | 8.0 | DISTANCIA | 3.0 | NO APLICA JORNADA | TERCER NIVEL | 13.0 | CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO | 15.0 | PERIODISMO E INFORMACION | LOJA | LOJA | LOJA, CABECERA CANTONAL Y CAPITAL PROVINCIAL | MATRIZ - LOJA | 800.0 | OFERTA PÚBLICA | 4896670101 | tercera_postulacion_per22.csv |
| 4 | 5 | 12263583.0 | 7459572.0 | 22 | NaN | 3.0 | POBLACION GENERAL | POBLACION GENERAL | 1.0 | 714.0 | 29103940.0 | 27/10/2021 15:27 | 312305.0 | 714.0 | 494 | 4 | 1.0 | 72 | UNIVERSIDAD NACIONAL DE LOJA | U | PÚBLICA | 177913 | A | 47704.0 | 29754 | 4473 | EDUCACION INICIAL | 8.0 | DISTANCIA | 3.0 | NO APLICA JORNADA | TERCER NIVEL | 14.0 | EDUCACION | 511.0 | EDUCACION | LOJA | LOJA | LOJA, CABECERA CANTONAL Y CAPITAL PROVINCIAL | MATRIZ - LOJA | 800.0 | OFERTA PÚBLICA | 4896670101 | tercera_postulacion_per22.csv |
| 5 | 6 | 11563446.0 | 6904301.0 | 22 | No escolar | 1.0 | POLITICA DE ACCION AFIRMATIVA | /POLITICA DE ACCION AFIRMATIVA | 1.0 | 774.0 | 29060047.0 | 27/10/2021 13:13 | 312155.0 | 774.0 | 494 | 1 | 1.0 | 48 | UNIVERSIDAD DE CUENCA | U | PÚBLICA | 177897 | A | 45489.0 | 31766 | 5087 | DISEÑO DE INTERIORES | 9.0 | PRESENCIAL | 1.0 | INTENSIVA | TERCER NIVEL | 11.0 | ARTES Y HUMANIDADES | 504.0 | ARTES | AZUAY | CUENCA | CUENCA, CABECERA CANTONAL Y CAPITAL PROVINCIAL. | MATRIZ - AZUAY. | 800.0 | OFERTA PÚBLICA | 1817840156 | tercera_postulacion_per22.csv |
| 6 | 7 | 11563446.0 | 6904301.0 | 22 | No escolar | 1.0 | POLITICA DE ACCION AFIRMATIVA | /POLITICA DE ACCION AFIRMATIVA | 1.0 | 774.0 | 29060049.0 | 27/10/2021 13:13 | 311927.0 | 774.0 | 494 | 3 | 1.0 | 48 | UNIVERSIDAD DE CUENCA | U | PÚBLICA | 179112 | A | 46911.0 | 31767 | 5167 | DISEÑO GRAFICO | 9.0 | PRESENCIAL | 1.0 | INTENSIVA | TERCER NIVEL | 11.0 | ARTES Y HUMANIDADES | 504.0 | ARTES | AZUAY | CUENCA | CUENCA, CABECERA CANTONAL Y CAPITAL PROVINCIAL. | MATRIZ - AZUAY. | 800.0 | OFERTA PÚBLICA | 1817840156 | tercera_postulacion_per22.csv |
| 7 | 8 | 11563446.0 | 6904301.0 | 22 | No escolar | 1.0 | POLITICA DE ACCION AFIRMATIVA | /POLITICA DE ACCION AFIRMATIVA | 1.0 | 774.0 | 29060048.0 | 27/10/2021 13:13 | 312375.0 | 774.0 | 494 | 2 | 1.0 | 48 | UNIVERSIDAD DE CUENCA | U | PÚBLICA | 172459 | A | 45840.0 | 31771 | 5029 | ELECTRICIDAD | 9.0 | PRESENCIAL | 1.0 | INTENSIVA | TERCER NIVEL | 26.0 | INGENIERIA, INDUSTRIA Y CONSTRUCCION | 63.0 | INGENIERIA Y PROFESIONES AFINES | AZUAY | CUENCA | CUENCA, CABECERA CANTONAL Y CAPITAL PROVINCIAL. | MATRIZ - AZUAY. | 800.0 | OFERTA PÚBLICA | 1817840156 | tercera_postulacion_per22.csv |
| 8 | 9 | 11704080.0 | 6974616.0 | 22 | NaN | 1.0 | POBLACION GENERAL | POBLACION GENERAL | 1.0 | 754.0 | 29251832.0 | 28/10/2021 10:24 | 312319.0 | 754.0 | 494 | 1 | 1.0 | 86 | UNIVERSIDAD TECNICA DE MANABI | U | PÚBLICA | 179253 | A | 45494.0 | 29284 | 4781 | DERECHO | 441.0 | EN LINEA | 3.0 | NO APLICA JORNADA | TERCER NIVEL | 13.0 | CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO | 510.0 | DERECHO | MANABI | PORTOVIEJO | PORTOVIEJO | MATRIZ - PORTOVIEJO | 800.0 | OFERTA PÚBLICA | 2734800183 | tercera_postulacion_per22.csv |
| 9 | 10 | 11704080.0 | 6974616.0 | 22 | NaN | 1.0 | POBLACION GENERAL | POBLACION GENERAL | 1.0 | 754.0 | 29251833.0 | 28/10/2021 10:24 | 312503.0 | 754.0 | 494 | 2 | 1.0 | 59 | UNIVERSIDAD ESTATAL DE MILAGRO | U | PÚBLICA | 176762 | A | 47778.0 | 30197 | 4781 | DERECHO | 441.0 | EN LINEA | 3.0 | NO APLICA JORNADA | TERCER NIVEL | 13.0 | CIENCIAS SOCIALES, PERIODISMO, INFORMACION Y DERECHO | 510.0 | DERECHO | GUAYAS | MILAGRO | MILAGRO, CABECERA CANTONAL | MATRIZ - MILAGRO | 800.0 | OFERTA PÚBLICA | 2734800183 | tercera_postulacion_per22.csv |
| Unnamed: 0 | INS_ID | INI_ID | PER_ID | INS_POBLACION | INS_TIPO_INSCRIPCION | SEGMENTO_ASPIRANTE | CAE_GRUPO | CAE_ESTADO | CAE_NOTA_POSTULA | POS_ID | POS_FECHA_POSTULACION | CUS_ID | NOTA_POSTULA | PRD_ID_NUM_POSTULACION | POS_PRIORIDAD | POS_ESTADO | IES_ID | IES_NOMBRE_INSTIT | IES_TIPO_IES | IES_TIPO_FINANCIAMIENTO | OFA_ID | IES_ESTADO | APC_ID | CCP_ID | CAR_ID | CAR_NOMBRE_CARRERA | MODALIDAD_ID | MODALIDAD | JORNADA_ID | JORNADA | NIVEL | AREA_ID | AREA_NOMBRE | SUBAREA_ID | SUBAREA_NOMBRE | PROVINCIA | CANTON | PARROQUIA | CAM_NOMBRE_CAMPUS | PRD_ID_SEGMENTO | SEGMETO_CARRERA | cod_final | archivo | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6575946 | 676 | 7775969.0 | 4406230.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511721.0 | NaN | NaN | NaN | 561 | 1 | NaN | 22 | NaN | NaN | NaN | 103621 | NaN | NaN | 20413 | 4911 | NaN | NaN | EN LINEA | NaN | NO APLICA JORNADA | TERCER NIVEL | NaN | NaN | NaN | NaN | PICHINCHA | RUMIÑAHUI | SANGOLQUÍ | NaN | NaN | POBLACION GENERAL | 1742482301 | cuarta_asigna_directa_per18.csv |
| 6575947 | 677 | 7954502.0 | 4643575.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511324.0 | NaN | NaN | NaN | 561 | 1 | NaN | 22 | NaN | NaN | NaN | 103621 | NaN | NaN | 20413 | 4911 | NaN | NaN | EN LINEA | NaN | NO APLICA JORNADA | TERCER NIVEL | NaN | NaN | NaN | NaN | PICHINCHA | RUMIÑAHUI | SANGOLQUÍ | NaN | NaN | POBLACION GENERAL | 2003922310 | cuarta_asigna_directa_per18.csv |
| 6575948 | 678 | 7779622.0 | 4419043.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511726.0 | NaN | NaN | NaN | 561 | 1 | NaN | 22 | NaN | NaN | NaN | 102184 | NaN | NaN | 20411 | 4618 | NaN | NaN | EN LINEA | NaN | NO APLICA JORNADA | TERCER NIVEL | NaN | NaN | NaN | NaN | PICHINCHA | RUMIÑAHUI | SANGOLQUÍ | NaN | NaN | POBLACION GENERAL | 1748602338 | cuarta_asigna_directa_per18.csv |
| 6575949 | 679 | 7946715.0 | 4580788.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511428.0 | NaN | NaN | NaN | 561 | 1 | NaN | 22 | NaN | NaN | NaN | 95017 | NaN | NaN | 20403 | 4473 | NaN | NaN | EN LINEA | NaN | NO APLICA JORNADA | TERCER NIVEL | NaN | NaN | NaN | NaN | PICHINCHA | RUMIÑAHUI | SANGOLQUÍ | NaN | NaN | POBLACION GENERAL | 2100392356 | cuarta_asigna_directa_per18.csv |
| 6575950 | 680 | 7473282.0 | 4174454.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511389.0 | NaN | NaN | NaN | 561 | 1 | NaN | 493 | NaN | NaN | NaN | 99593 | NaN | NaN | 20138 | 7100 | NaN | NaN | PRESENCIAL | NaN | MATUTINA | TECNOLOGICO SUPERIOR | NaN | NaN | NaN | NaN | SANTO DOMINGO DE LOS TSACHILAS | SANTO DOMINGO | SANTO DOMINGO DE LOS COLORADOS | NaN | NaN | POBLACION GENERAL | 2067762374 | cuarta_asigna_directa_per18.csv |
| 6575951 | 681 | 7493865.0 | 4184775.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511460.0 | NaN | NaN | NaN | 561 | 1 | NaN | 30 | NaN | NaN | NaN | 98801 | NaN | NaN | 19896 | 4533 | NaN | NaN | PRESENCIAL | NaN | NOCTURNA | TERCER NIVEL | NaN | NaN | NaN | NaN | MANABI | BOLIVAR | CALCETA | NaN | NaN | POBLACION GENERAL | 1320052301 | cuarta_asigna_directa_per18.csv |
| 6575952 | 682 | 7749169.0 | 4479372.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511504.0 | NaN | NaN | NaN | 561 | 1 | NaN | 22 | NaN | NaN | NaN | 95017 | NaN | NaN | 20403 | 4473 | NaN | NaN | EN LINEA | NaN | NO APLICA JORNADA | TERCER NIVEL | NaN | NaN | NaN | NaN | PICHINCHA | RUMIÑAHUI | SANGOLQUÍ | NaN | NaN | POBLACION GENERAL | 1695572383 | cuarta_asigna_directa_per18.csv |
| 6575953 | 683 | 7794139.0 | 4449302.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511405.0 | NaN | NaN | NaN | 561 | 1 | NaN | 22 | NaN | NaN | NaN | 103621 | NaN | NaN | 20413 | 4911 | NaN | NaN | EN LINEA | NaN | NO APLICA JORNADA | TERCER NIVEL | NaN | NaN | NaN | NaN | PICHINCHA | RUMIÑAHUI | SANGOLQUÍ | NaN | NaN | POBLACION GENERAL | 1772182365 | cuarta_asigna_directa_per18.csv |
| 6575954 | 684 | 7797070.0 | 4431939.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511635.0 | NaN | NaN | NaN | 561 | 1 | NaN | 22 | NaN | NaN | NaN | 95017 | NaN | NaN | 20403 | 4473 | NaN | NaN | EN LINEA | NaN | NO APLICA JORNADA | TERCER NIVEL | NaN | NaN | NaN | NaN | PICHINCHA | RUMIÑAHUI | SANGOLQUÍ | NaN | NaN | POBLACION GENERAL | 1776772310 | cuarta_asigna_directa_per18.csv |
| 6575955 | 685 | 7309997.0 | 4092839.0 | 18 | NaN | NaN | NaN | NaN | NaN | NaN | 18511156.0 | NaN | NaN | NaN | 561 | 1 | NaN | 59 | NaN | NaN | NaN | 99032 | NaN | NaN | 20897 | 4618 | NaN | NaN | EN LINEA | NaN | NO APLICA JORNADA | TERCER NIVEL | NaN | NaN | NaN | NaN | GUAYAS | MILAGRO | MILAGRO, CABECERA CANTONAL | NaN | NaN | POBLACION GENERAL | 4884592474 | cuarta_asigna_directa_per18.csv |